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Abstract 

This is a substantially expanded version of a chapter-contribution to The Springer Handbook of Spacetime, 
edited by Abhay Ashtekar and Vesselin Petkov, published by Springer Verlag in 2014. It introduces the 
reader to the reformulation of Einstein’s field equations of General Relativity as a constrained evolutionary 
system of Hamiltonian type and discusses some of its uses, together with some technical and conceptual 
aspects. Attempts were made to keep the presentation self contained and accessible to first-year graduate 
students. This implies a certain degree of explicitness and occasional reviews of background material. 
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1 Introduction 

The purpose of this contribution is to explain how 
the field equations of General Relativity—often 
simply referred to as Einstein’s equations —can be 
understood as dynamical system; more precisely, 
as a constrained Hamiltonian system. 

In General Relativity, it is often said, spacetime 
becomes dynamical. This is meant to say that 
the geometric structure of spacetime is encoded in 
a held that, in turn, is subject to local laws of prop¬ 
agation and coupling, just as, e.g., the electromag¬ 
netic held. It is not meant to say that spacetime as 
a whole evolves. Spacetime does not evolve, space- 
time just is. But a given spacetime (four dimen¬ 
sional) can be viewed as the evolution, or history, 
of space (three dimensional). There is a huge re¬ 
dundancy in this representation, in the sense that 
apparently very different evolutions of space rep¬ 
resent the same spacetime. However, if the result¬ 
ing spacetime is to satisfy Einstein’s equations, the 
evolution of space must also obey certain well de- 
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fined restrictions. Hence the task is to give pre¬ 
cise mathematical expression to the redundancies 
in representation as well as the restrictions of evolu¬ 
tion for this picture of spacetime as space’s history. 
This will be our main task. 

This dynamical picture will be important for 
posing and solving time-dependent problems in 
General Relativity, like the scattering of black holes 
with its subsequent generation and radiation of 
gravitational waves. Quite generally, it is a key 
technology to 

• formulate and solve initial value problems; 

• integrate Einstein’s equations by numerical 
codes; 

• characterize dynamical degrees of freedom; 

• characterize isolated systems and the associ¬ 
ation of asymptotic symmetry groups, which 
will give rise to globally conserved ‘charges’, 
like energy and linear as well as angular mo¬ 
mentum (Poincare charges). 

Moreover, it is also the starting point for the canon¬ 
ical quantization program, which constitutes one 
main approach to the yet unsolved problem of 
Quantum Gravity. In this approach one tries to 
make essential use of the Hamiltonian structure of 
the classical theory in formulating the correspond¬ 
ing quantum theory. This strategy has been ap¬ 
plied successfully in the transition from classical 
to quantum mechanics and also in the transition 
from classical to quantum electrodynamics. Hence 
the canonical approach to Quantum Gravity may 
be regarded as conservative, insofar as it tries to 
apply otherwise established rules to a classical the¬ 
ory that is experimentally and observationally ex¬ 
tremely well tested. The underlying hypothesis 
here is that we may quantize interaction-wise. This 
distinguishes this approach from string theory, the 
underlying credo of which is that Quantum Gravity 
only makes sense on the basis of a unified descrip¬ 
tions of all interactions. 

Historically the first paper to address the prob¬ 
lem of how to put Einstein’s equations into the 
form of a Hamiltonian dynamical system was 


Dirac’s [M] from 1958. He also noticed its con¬ 
strained nature and started to develop the cor¬ 
responding generalization of constrained Hamilto¬ 
nian systems in |53j and their quantization [55) . 
On the classical side, this developed into the more 
geometric Dirac-Bergmann theory of constraints 
EH and on the quantum side into an elaborate 
theory of quantization of systems with gauge re¬ 
dundancies; see [80] for a comprehensive account. 

Dirac’s attempts were soon complemented by an 
extensive joint work of Richard Arnowitt, Stanley 
Deser, and Charles Misner - usually and henceforth 
abbreviated by ADM. Their work started in 1959 
by a paper of the first two of these authors and 
continued in the series ll]ilS]l9]il7][IO][n] 
mmmm of 12 more papers by all three. A 
comprehensive summary of their work was given in 
1962 in [TB], which was republished in 2008 in [17) : 
see also the editorial note [Ml with short biogra¬ 
phies of ADM. 

A geometric discussion of Einstein’s evolution 
equations in terms of infinite-dimensional symplec- 
tic geometry has been worked out by Fischer and 
Marsden in EH; see also their beautiful summaries 
and extended discussions in and [SB]. More 
on the mathematical aspects of the initial-value 
problem, including the global behavior of gravita¬ 
tional fields in General Relativity, can be found 
in |3S|, |3S], and [106) . Modern text-books on the 
3-1-1 formalism and its application to physical prob¬ 
lems and their numerical solution-techniques are 
[27l [77] . The Hamiltonian structure and its use in 
the canonical quantization program for gravity is 
discussed in (MUSI (Ml dS]. 

2 Notation and conventions 

From now on “General Relativity” will be abbrevi¬ 
ated by “GR”. Spacetime is a differentiable man¬ 
ifold M of dimension n, endowed with a metric 
g of signature (e,-f, • • • ,-|-). In GR n = 4 and 
e = — 1 and it is implicitly understood that these 
are the “right” values. However, either for the 
sake of generality and/or particular interest, we 
will sometimes state formulae for general n and 
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£, where usually n > 2 (sometimes n > 3) and 
either e = — 1 (Lorentzian metric) or £ = +1 (Rie- 
mannian metric; also called Euclidean metric). 

The case e = 1 has been extensively considered 
in path-integral approaches to Quantum Gravity, 
then referred to as Euclidean Quantum Gravity. 

The tangent space of M at point p £ M will be 
denoted by TpM, the cotangent space by T* M, and 
the tensor product of u factors of TpM with d fac¬ 
tors of T*M by T^M. (Mnemonic in components: 
u = number of indices “upstairs”, d = number of 
indices “downstairs”.) An element T in is 

called a tensor of contravariant rank u and covari¬ 
ant rank d at point p, or simply a tensors of rank 
(u,d) at p. T is called contravariant if d = 0 and 
M > 0, and covariant if u = 0 and d > 0. A tensor 
with u > 0 and d > 0 is then referred to as of mixed 
type. Note that TpM = Tp^M and T^M = Tp^M. 
The set of tensor fields, i.e. smooth assignments of 
an element in T^^M for each p G M, are denoted 
by TT^M. Unless stated otherwise, smooth means 
C°°, i.e. continuously differentiable to any order. 
For t G TT^M we denote by tp G T^'^M the eval¬ 
uation of t at p G M. C°°[M) denotes the set of 
all C°° real-valued functions on M, which we often 
simply call smooth functions. 

If / : M —)■ A^ is a diffeomorphism between 
manifolds M and N, then /*p : TpM —)• rj(p)A^ 
denotes the differential at p. The transposed (or 
dual) of the latter map is, as usual, denoted by 
/; : ^ T*M. If a: is a vector field 

on M then /*X is a vector field on N, called 
the push forward of AT by /. It is defined by 
if*X)q := /*/-i( 9 )^/-i(,), for all q G N. If a 
is a CO- vector field on N then f*a is a co-vector 
field on M, called the pull back of a by /. It is 
defined by {f*a)p := a/(p) o /*p, for all p G M. For 
these definitions to make sense we see that / need 
generally not be a diffeomorphism; M and N need 
not even be of the same dimension. More precisely, 
if q: is a smooth field of co-vectors that is defined at 
least on the image of / in N, then f*a, as defined 
above, is always a smooth field of co-vectors on M. 
However, for the push forward /, of a general vec¬ 
tor field on M to result in a well defined vector field 
on the image of / in we certainly need injectivity 


of /. If / is a diffeomorphism we can not only push- 
forward vectors and pull back co-vectors, but also 
vice versa. Indeed, if U is a vector field on N one 
can write f*Y := (/“^)*F and call it the pull back 
of Y by the diffeomorphism /. Likewise, if /I is a 
co-vector field on M, one can write /*/? := 
and call it the push forward of j3. In this fashion 
we can define both, push-forwards and pull-backs, 
of general tensor fields T G TT^M by linearity and 
applying /* or /* tensor-factor wise. 

Note that the general definition of metric is as 
follows: g G TT^M, such that Pp is a symmet¬ 
ric non-degenerate bilinear form on TpM. Such 

a metric provides isomorphisms (sometimes called 
the musical isomorphisms) 

b : TpM -G T*M 

:=g{X,-), (la) 

tt : T;M -g TpM 

oj ^ ■.= \>~^[uj). (lb) 

Using K we obtain a metric gfi^ on T^M from the 
metric gp on TpM as follows: 

^(wi,W2) := Pp(w5,a;^) = a;i(a;^). (2) 

We also recall that the tensor space Tp\M is nat¬ 

urally isomorphic to the linear space End(TpM) 
of all endomorphisms (linear self maps) of TpM. 
Hence it carries a natural structure as associative 
algebra, the product being composition of maps 
denoted by o. As usual, the trace, denoted Tr, 
and the determinant, denoted det, are the natu¬ 
rally defined real-valued functions on the space of 
endomorphisms. For purely co- or contravariant 
tensors the trace can be defined by first applying 
one of the isomorphisms 0 . In this case we write 
Trp to indicate the dependence on the metric g. 

Geometric representatives of curvature are of¬ 
ten denoted by bold-faced abbreviations of their 
names, like Riem and Weyl for the (covariant, i.e. 
all indices down) Riemann and Weyl tensors, Sec 
for the sectional curvature, Ric and Ein for the 
Ricci and Einstein tensors. Seal for the scalar cur¬ 
vature, and Wein for the Weingarten map (which 
is essentially equivalent to the extrinsic curvature). 
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This is done in order to highlight the geometric 
meaning behind some basic formulae, at least the 
simpler ones. Later, as algebraic expressions be¬ 
come more involved, we will also employ the stan¬ 
dard component notation for computational ease. 

3 Einstein’s equations 

In n-dimensional spacetime Einstein’s equations 
form a set of ^n{n + 1) quasi-linear partial dif¬ 
ferential equations of second order for |n(n -I- 1) 
functions (the components of the metric tensor) 
depending on n independent variables (the coor¬ 
dinates in spacetime). At each point of spacetime 
(event) they equate a purely geometric quantity to 
the distribution of energy and momentum carried 
by the matter. More precisely, this distribution 
comprises the local densities (quantity per unit vol¬ 
ume) and current densities (quantity per unit area 
and unit time) of energy and momentum. The ge¬ 
ometric quantity in Einstein’s equations is the Ein¬ 
stein tensor Ein, the matter quantity is the energy- 
momentum tensor T. Both tensors are of second 
rank, symmetric, and here taken to be covariant 
(in components: “all indices down”). Their num¬ 
ber of independent components in n spacetime di¬ 
mensions is ^n{n + 1) 

Einstein’s equations (actually a single tensor 
equation, but throughout we use the plural to 
emphasize that it comprises several component 
equations) state the simple proportionality of Ein 
with T 

Ein = kT , (3) 

where k denotes the dimensionful constant of pro¬ 
portionality. Note that no explicit reference to the 
dimension n of spacetime enters ([^, so that even 
if n ^ 4 it is usually referred to as Einstein’s equa¬ 
tions. We could have explicitly added a cosmolog¬ 
ical constant term gA on the left-hand side, where 
A is a constant the physical dimension of which is 
the square of an inverse length. However, as long 
as we write down our formulae for general T we 
may absorb this term into T where it accounts for 
a contribution Ta = —gA/K. This has to be kept 


in mind when explicit models for T are used and 
when we speak of “vacuum”, which now means: 

Tvacuum = Ta := -K~^gA. (4) 

The signs are chosen such that a positive A ac¬ 
counts for a positive energy density and a negative 
pressure if the spacetime is Lorentzian (e = —1). 

There is another form of Einstein’s equations 
which is sometimes advantageous to use and in 
which n explicitly enters: 

Ric = «(T-;^5Trg(T)). (5) 

These two forms are easily seen to be mathemati¬ 
cally equivalent by the identities 

Ein = Ric — igTrg(Ric), (6a) 

Ric = Ein - ;^ 5 Trg(Ein). (6b) 

With respect to a local field of basis vectors 
{eo,ei,-- - ,e„_i} we write Ein(e^,e,.) =: 
T(6^,6^.) —. and R.ic(ep,et/) —. Then 

(|^ and ([^ take on the component forms 

G^, = kT^, (7) 

and 

R/ii/ = K (8) 

respectively. Next we explain the meanings of the 
symbols in Einstein’s equations from left to right. 

3.1 What aspects of geometry? 

The left-hand side of Einstein’s equations com¬ 
prises certain measures of curvature. As will be 
explained in detail in Sectionj^ all curvature in¬ 
formation in dimensions higher than two can be 
reduced to that of sectional curvature. The sec¬ 
tional curvature at a point p € M tangent to 
spanjX, F} C TpM is the Gaussian curvature at 
p of the submanifold spanned by the geodesics in 
M emanating from p tangent to spanjAi, F}. The 
Gaussian curvature is defined as the product of two 
principal curvatures, each being measured in units 
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of an inverse length (the inverse of a principal ra¬ 
dius). Hence the Gaussian curvature is measured 
in units of an inverse length-squared. 

At each point p in spacetime the Einstein and 
Ricci tensors are symmetric bilinear forms on TpM. 
Hence Einp and RiCp are determined by the values 
Einp(lE,lE) and RiCp(W,VE) for all W G TpM. 
By continuity in W this remains true if we restrict 
W to the open and dense set of vectors which are 
not null, i.e. for which g{W, W) ^ 0. As we will 
see later on, we then have 


Ni 

Ein(lE, W) = -g{W, W) ^ Sec, (9) 

±w 

N2 

Ric(VE,tE) = -h 5 (VE, IE) ^ Sec. (10) 

lirv 

For the Einstein tensor the sum on the right-hand 
side is over any complete set of Ni = ^{n—l){n—2) 
sectional curvatures of pairwise orthogonal planes 
in the orthogonal complement of W in TpM. For 
the Ricci tensor it is over any complete set of 
N 2 = n—1 sectional curvatures of pairwise orthog¬ 
onal planes containing W. If IE is a timelike unit 
vector representing an observer, Ein(lE, IE) is sim¬ 
ply (—e) times an equally weighted sum of space¬ 
like sectional curvatures, whereas Ric(IE, IE) is e 
times an equally weighted sum of timelike sectional 
curvatures. In that sense we may say that, e.g., 
Ein(IE, IE) at p G M measures the mean Gaussian 
curvature of the (local) hypersurface in M that is 
spanned by geodesics emanating from IE orthogo¬ 
nal to IE. It, too, is measured in units of the square 
of an inverse length. 

3.2 What aspects of matter? 

Now we turn to the right-hand side of Einstein’s 
equations. We restrict to four spacetime dimen¬ 
sions, though much of what we say will apply ver¬ 
batim to other dimensions. The tensor T on the 
right-hand side of ([^ is the energy-momentum ten¬ 
sor of matter. With respect to an orthonormal 
basis {eo, ei, • • • , e„_i} with timelike cq the com¬ 
ponents Tpy := T(ep,ej/) form a symmetric 4x4 


matrix, which we represent as follows by splitting 
off terms involving a time component: 


T — 

fll/ — 




( 11 ) 


Here all matrix elements refer to the matter’s en¬ 
ergy momentum distribution relative to the rest 
frame of the observer who momentarily moves 
along eo (i.e. with four-velocity u = ccq) and uses 
the basis { 61 , 62 , 63 } in his/her rest frame. Then 
£ = Too is the energy density, S = (si,S 2 ;S 3 ) the 
(components of the) energy current-density, i.e. en¬ 
ergy per unit surface area and unit time interval, 
A4 the momentum density, and finally T^n the 
(component of the) momentum current-density, i.e. 
momentum per unit of area and unit time interval. 
Note that symmetry T^^ = T^p implies a simple re¬ 
lation between the energy current-density and the 
momentum density 


S = c^M. 


( 12 ) 


The remaining relations T^n = Tnm express equal¬ 
ity of the m-th component of the current density 
for n-momentum with the n-th component of the 
current density for m-momentum. Note that the 
two minus signs in front of the mixed components 
of 0 would have disappeared had we written 
down the contravariant components T^'^. In flat 
spacetime, the four equations d^T^^, express the 
local conservation of energy and momentum. In 
curved spacetime (with vanishing torsion) we have 
the identity (to be proven later; compare (90b)) 


= 0 (13) 

implies via (Q 

V'^T^, = 0, (14) 

which may be interpreted as expressing a local con¬ 
servation of energy and momentum for the matter 
plus the gravitational field, though there is no such 
thing as a separate energy-momentum tensor on 
spacetime for the gravitational field. 

Several positivity conditions can be imposed 
on the energy momentum tensor T. The sim¬ 
plest is known as weak energy-condition and reads 
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T{W, W) > 0 for all timelike W. It is equivalent to 
the requirement that the energy density measured 
by any local observer is non negative. For a per¬ 
fect fluid of rest-mass density p and pressure p the 
weak energy-condition is equivalent to both con¬ 
ditions p > 0 and p > —c^p. The strong energy- 
condition says that (T — ipTrg(T)) (VF, kF) > 0 
again for all timelike W. This neither follows nor 
implies the weak energy-condition. For a perfect 
fluid it is equivalent to both conditions p > —c^p 
and p > —(?pI?), i.e. to the latter alone if p is 
positive and to the former alone if p is negative 
(which is not excluded here). Its significance lies 
in the fact that it ensures attractivity of gravity 
as described by Einstein’s equations. It must, for 
example, be violated if matter is to drive infla¬ 
tion. Note that upon imposing Einstein’s equa¬ 
tions the weak and the strong energy-conditions 
read Ein(IF, W) > 0 and Ric(kF, W) > 0 respec¬ 
tively. From © and ( [To| we can see that for fixed 
W these imply conditions on complementary sets of 
sectional curvatures. For completeness we mention 
the condition of energy dominance, which states 
that T(IF,kF) > |T(X, X)| for any pair of or¬ 
thonormal vectors IF, X where IF is timelike (and 
hence X is spacelike). It is equivalent to the weak 
energy-condition supplemented by the requirement 
that (iwT)® be non spacelike for all timelike IF. 
The second requirement ensures locally measured 
densities of energy currents and momenta of matter 
to be non spacelike. 


2013) known with a relative standard uncertainty 
of 1.2 X 10“"^ and is thus by far the least well 
known of the fundamental physical constants, c = 
299 792 458 m • s“^ is the vacuum speed of light 
whose value is exact, due to the Si-definition of 
meter (“the meter is the length of the path trav¬ 
eled by light in vacuum during a time interval of 
1/299 792 458 of a second”). 

The physical dimension of k is 
time^/(mass • length), that is in STunits 
s^ • kg“^ • m“^ or m“^/(J • m“^), where J = 
Joule = kg • m^ • s“^. It converts the common 
physical dimension of all components T/jy, which 
is that of an energy density (Joule per cubic 
meter in STunits) into that of the components 
of Ein, which is that of curvature (in dimension 
> 2), i.e., the square of an inverse length (inverse 
square-meter in STunits). 

If we express energy density as mass density 
times c^, the conversion factor is kc^ = SttGIc^. 
It can be expressed in various units that give a 
feel for the local “curving power” of mass-densities. 

For that of water, pw ~ 10^ kg • m“^, and nu¬ 
clear matter in the core of a neutron star (which 
is more than twice that of atomic nuclei), p^ ~ 
5 X 10^^ kg • m“^, we get, respectively: 


3.3 How do geometry and matter 
relate quantitatively? 


We return to Einstein’s equations and finally dis¬ 
cuss the constant of proportionality k on the right- 
hand side of Its physical dimension is that of 
curvature (m^ in SI units) divided by that of en¬ 
ergy density {J■m~^ in SI units, where J = Joule). 
It is given by 


87rG 


2.1 X 


J ■ 


t-3 ’ 


(15) 


where G « 6.67384(80) x IQ-^m^ • kg-^ • s-^ is 
Newton’s constant. It is currently (March 


where AU = 1.5 x 10^^ m is the astronomical unit 
(mean Earth-Sun distance). Hence, roughly speak¬ 
ing, matter densities of water cause curvature radii 
of the order of the astronomical unit, whereas the 
highest known densities of nuclear matter cause 
curvature radii of tens of kilometers. The curva¬ 
ture caused by mere mass density is that expressed 
in Ein(IF, IF) when IF is taken to be the unit time¬ 
like vector characterizing the local rest frame of the 
matter: It is a mean of spatial sectional curvatures 
in the matter’s local rest frame. Analogous inter¬ 
pretations can be given for the curvatures caused 
by momentum densities (energy current-densities) 
and momentum current-densities (stresses). 
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3.4 Conserved energy-momentum 
tensors and globally conserved 
quantities 


In this subsection we briefly wish to point out 
that energy-momentum tensors T whose diver¬ 
gence vanishes (14) give rise to conserved quanti¬ 
ties in case the spacetime (M, g) admits non-trivial 
isometries. We will stress the global nature of these 
quantities and clarify their mathematical habitat. 

Conservation laws for the matter alone result 
in the presence of symmetries, more precisely, if 
Killing fields for {M,g) exist. Recall that a vector 
field V is called a Killing field iff Lyg = 0, where 
Ly is the Lie derivative with respect to V. Recall 
that the Lie derivative can be expressed in terms of 
the Levi-Civita covariant derivative with respect to 
g, in which case we get the component expression: 


(Lvff);.. = + V.R;, = 0. (17) 

We consider the one-form Jy that results from 
contracting T with V: 

Jy ■= lyT = V^T^^dx''. (18) 

As a result of Killing’s equation 0 it is divergence 
free, 

V^JO = 0. (19) 

This may be equivalently expressed by saying that 
the 3-form -kJy, which is the Hodge dual of the 
1 -form Jy, is closed: 


d*Jy = 0. (20) 


isometries. We will discuss general Lie-group ac¬ 
tions on manifolds in the Appendix at the end 
of this contribution, containing detailed proofs of 
some relevent formulae. But in order not to inter¬ 
rupt the argument too much, let us recall at this 
point that an action of G on M is a map 


$ : G X M ^ M, 

(ff, m) 1 -^ $( 5 , to) = ^'^(to) , 

which satisfies 


= Id 


M : 


o = 4> 


gh ■ 


( 22 a) 


( 22 b) 

( 22 c) 


Here e G G denotes the neutral element, Wm the 


identity map on M, and equation (22c) is valid for 


any two elements g, h of G. In fact, equation (22c) 


characterizes a left action. In contrast, for a right 
action we would have ^hg instead of on the 
right-hand side of (22c). Moreover, as the group 


acts by isometries for the metric g, we also have 
^h9 = 9 ^11 h G G. 

Now, this action defines a map, V, from Lie(G), 
the Lie algebra of G, into the vector fields on M. 
The vector field corresponding to Af S Lie(G) is 
denoted . Its value at a point to S M is defined 
by 

<i)(exp(tX), to) . (23) 


vy™) := ^ 


t=0 


From this it is obvious that V : Lie(G) 
is linear. Moreover, one may also show (compare 


TT^M 


{ 382b) in Appendix) that this map is a Lie anti¬ 
homomorphism, i.e. that 


Integrating *</y over some 3-dimensional subman¬ 
ifold E results in a quantity 

Q[K,S]:=/*Jy (21) 


which, because of ( 20 ), is largely independent of 


E. More precisely, if H C M is an oriented domain 
with boundary (3f2 = Ei —E 2 , then Stokes’ theorem 
gives Q[V, El] = Q[V, E 2 ]. 

Suppose now that V arises from a finite¬ 
dimensional Lie group G that acts on {M,g) by 


ring 


(24) 


(As shown in the Appendix, a right action would 
have resulted in a proper Lie homomorphism - see 
( |382a ) -, i.e. without the minus sign on the right- 
hand side, which however is not harmful.) The left 
action of G on M extends to a left action on all 
tensor fields by push forward. In particular, the 
push forward of by has a simple expression 


(see (383a) in Appendix) : 


= yAdg(X) 


(25) 
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where Ad denotes the adjoint representation of 
G on Lie(G). In fact, relation (25) can be di¬ 


nature. In any case, we assume the isometric ac¬ 


rectly deduced from definition (23). Indeed, writ¬ 


ing $(( 7 ,p) = g-p for notational simplicity, we have 
(see Appendix for more explanation) 


_ d 

dt 
_ d 
dt 








(5 exp(tA:) • p) 

{g exp{tX)g-^g-p) 
(exp{tAdg{X)) ■ (g-p) 




(26) 


This leads to (25) which we shall use shortly. 


Returning to the expression (21) we see that, for 


fixed E, it becomes a linear map from Lie(G) to K: 

9JI : Lie(G) ^ K, m{X) := Q[V^. (27) 

Hence each hypersurface E defines an element 9Jt G 
Lie*(G) in the vector space that is dual to the Lie 
algebra, given that the integral over E converges. 
This is the case for spacelike E and energy momen¬ 
tum tensors with spatially compact support (or at 
least sufficiently rapid fall off). The same argument 
as above using Stokes’ theorem and (20) then shows 


that is independent of the choice of spacelike 
E. In other words, we obtain a conserved quantity 
fUt G Lie*(G) for G-symmetric spacetimes (M,g) 
and covariant divergence free tensors T. 

So far we considered a fixed spacetime (M, g) and 
a fixed energy-momentum tensor T, both linked by 
Einstein’s equations. In this case the vanishing di¬ 


vergence (141 is an integrability condition for Ein¬ 
stein’s equation and hence automatic. However, 
it is also of interest to consider the more general 
case where (M, g) is merely a background for some 
matter represented by energy-momentum tensors 


T[^], all of which are divergence free (14) with re¬ 
spect to the background metric g. Note that we 
do not assume {M,g) to satisfy Einstein’s equa¬ 
tions with any of the T[p,] on the right-hand side. 
The /i stands for some matter variables which may 
be fundamental fields and/or of phenomenological 


tion (22 1 to extend to an action of G on the set of 
matter variables p, which we denote by /i 1 —>■ 
like the push-forward on tensor fields. This is also 
meant to indicate that we assume this to be a left 
action, i.e. $g* o 

We regard the energy-momentum tensor T as a 
map from the space of matter variables to the space 
of symmetric second-rank covariant tensor fields on 
M. We require this map to satisfy the following 
covariance property: 


T[<l>g,^] = := 4>* iT, 


(28) 


where is the ordinary push-forward of the ten¬ 
sor T. Since we take T to be covariant, its push 
forward is the pull back by the inverse diffeomor- 


phism, as indicated by the second equality in (28). 


For each specification p of matter variables we 
can compute the quantitty Q\y^p] as in ( 21 ). 


Note that we now indicate the dependence on p 
explicitly. We are interested in computing how Q 
changes as p is acted on by g S G. This is done as 
follws: 

Q[E^,E,$g*/r] = [ *ivxT[^g^p] 

Jy. 

J 2 

= / i(x)T[/r] 

d<E.^_i(S) 

(29) 


Here we used (28) in the second equality, the gen¬ 


eral formula ivf*T = f*{if^vT) (valid for any dif- 
feomorphism /, vector field V, and covariant ten¬ 


sor field T) in the third equality, (25) in the fourth 
equality, the formula *f*F = f**F in the fifth 





















equality (valid for any orientation preserving isom¬ 
etry / and any form-field F] here we assume M 
to be oriented), and finally the general formula for 
the integral of the pull back of a form in the sixth 
equality. 

Our final assumption is that Q does not de¬ 
pend on which hypersurface d>g(I]) it is evaluated 


on. Since we assume (14) this is guaranteed if all 


$g(E) are in the same homology class or, more 
generally, if any two hypersurfaces E and $g(E) 
are homologous to hypersurfaces in the comple¬ 
ment of the support of T. A typical situation 
arising in physical applications is that of a source 
T[^] with spatially compact support; then any two 
sufficiently extended spacelike slices through the 
timelike support-tube of T[/r] is homologous to 
the timelike cylindrical hypersurface outside this 
support-tube. In this case we infer from ([2^ that 




Recall from ( [2^ that for fixed E and T we have 
51t S Lie*(G). Given the independence on E and 
the depencence of T on /r, we now regard 911 as a 
map from the matter variables /r to Lie*(G). This 
map may be called the momentum map. (Compare 
the notion of a momentum map in Hamiltonian 
mechanics; cf. Section 111 ) Equation ( |^ then just 
states the Ad*-equivariance of the momentum map: 


91lo$g, = Ad;o911. (31) 

Here Ad* denote the co-adjoint representation of 
G on Lie(G), which is defined by Ad* (a) = a o 
Adg-i. From all this we see that the conserved 
“momentum” that we obtain by evaluating DJI on 
the matter configuration /r is a conserved quantity 
that is globally associated to all of spacetime, not 
a particular region or point of it. It is an element 
of the vector space Lie*(G) which carries the co- 
adjoint representation of the symmetry group G. 


In particular this applies to Special Relativ¬ 
ity, where M is the four-dimensional real affine 
space with associated (four-dimensional real) vec¬ 
tor space V and g a bilinear, symmetric, non¬ 
degenerate form of signature (—, -k, +, -k) [the sig¬ 


nature does not matter in what follows]. The lin¬ 
ear isometries of {V, rf) form the Lorentz group 
Lor C GL{V) and the isometries G of {M,g) 
can be (non-naturally) identified with the semi- 
direct product V X Lor, called the Poincare group, 
Poin. Using g we can identify Lie*(Pom) with 
U©(UAU). The co-adjoint action of (a. A) G Poin 
on (/, P) G Lie* (Poin) is then given by 

(/, F)= (a/, (A0 A)F-aA Af) . (32) 

Note that, e.g., the last term on the right hand side 
includes the law of change of angular momentum 
under spatial translations. In contrast, the adjoint 
representation on Lie(Poin), the latter also identi¬ 
fied with U © (U A U), is given by 


Ad(,,^)(/,P) = (a/ - ((A© A)P)a, (A©A)P) , 

(33) 

where the application of an element in U A U 
to an element in V is given by (u A v)(w) := 
ug(v,w) — vg(u,w), and linear extension. Note 
the characteristic difference between ( [3^ and ( [3^ , 
which lies in the different actions of the subgroup 
of translations, whereas the subgroup of Lorentz 
transformations acts in the same fashion. Physical 


momenta transform as in (32), as already exempli¬ 


fied by the non-trivial transformation behavior of 
angular momentum under spatial translations. For 
a detailed discussion of the proper group-theoretic 
setting and the adjoint and co-adjoint actions, see 
the recent account m 


4 Spacetime decomposition 

In this section we explain how to decompose a given 
spacetime (M, g) into “space” and ‘time”. For this 
to be possible we need to make the assumption that 
M is diffeomorphic to the product of the real line 
K and some 3-manifold E: 

M ^ K X E . (34) 
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This will necessarily be the case for globally hy¬ 
perbolic spacetimes, i.e. spacetimes admitting a 




Cauchy surface [5S] . We assume E to be orientable, 
for, if it were not, we could take the orientable 
double cover of it instead. Orientable 3-manifolds 
are always parallelizable m , i.e. admit three 
globally defined and pointwise linearly independent 
vector fields. This is equivalent to the triviality of 
the tangent bundle. In the closed case this is known 
as Stiefel’s theorem (compare [lOOj . problem 12- 
B) and in the open case it follows, e.g., from the 
well known fact that every open 3-manifold can 
be immersed in m- Note that orientabil- 
ity is truly necessary; e.g., x is not par¬ 
allelizable. Since Cartesian products of paralleliz¬ 
able manifolds are again parallelizable, it follows 
that a 4-dimensional product spacetime (34) is also 
parallelizable. This does, of course, not generalize 
to higher dimensions. Now, for non-compact four¬ 
dimensional spacetimes it is known from |b4j that 
parallelizability is equivalent to the existence of a 
spin structure, without which spinor fields could 
not be defined on spacetime. So we see that the 
existence of spin structure is already implied by 
(341 and hence does not pose any further topolog¬ 
ical restriction. Note that the only other potential 
topological restriction at this stage is that imposed 
from the requirement that a smooth Lorentz metric 
is to exist everywhere on spacetime. This is equiv¬ 
alent to a vanishing Euler characteristic (see, e.g., 
§40 in m) which in turn is equivalently to the 
global existence of a continuous, nowhere vanish¬ 
ing vector field (possibly up to sign) on spacetime. 
But such a vector held clearly exists on any Carte¬ 
sian product with one factor being M. We conclude 
that existence of a Lorentz metric and a spin struc¬ 
ture on an orientable spacetime M = M x E pose 
no restrictions on the topology of an orientable E. 
As we will see later on, even Einstein’s equation 
poses no topological restriction on E, in the sense 
that some (physically reasonable) solutions to Ein¬ 
stein’s equations exist for any given E. Topological 
restrictions may occur, however, if we ask for solu¬ 
tion with special properties (see below). 

Now, given E, we consider a one-parameter fam¬ 
ily of embeddings 



Figure 1: Spacetime M is foliated by a one-parameter 
family of spacelike embeddings of the 3-manifold E. Here 
the image E„/ of S under lies to the future (above) and 
Eg/; to the past (below) of Eg if s" < s < s'. ‘Future’ and 
‘past’ refer to the time function t which has so far not been 
given any metric significance. 


We distinguish between the abstract 3-manifold 
E and its image Eg in M. The latter is called the 
leaf corresponding to the value s € K. Each point 
in M is contained in precisely one leaf. Hence there 
is a real valued function t : M —>■ K that assigns to 
each point in M the parameter value of the leaf it 
lies on: 

t{p) = s p G Eg. (36) 

So far this is only a foliation of spacetime by 3- 
dimensional leaves. For them to be addressed as 
“space” the metric induced on them must be posi¬ 
tive definite, that is, the leaves should be spacelike 
submanifolds. This means that the one-form dt is 
timelike: 

g~^{dt,dt) < 0. (37) 

The normalized field of one-forms is then 


to .— — . — . 

\/-9~^{dt,dt) 

As explained in section]^ we write since we think 
of this one form as the image under g of the nor¬ 
malized vector field perpendicular to the leaves: 

n'' = g{n, ■). (39) 

The linear subspace of vectors in TpM which are 
tangent to the leaf through p is denoted by tJ M ; 
hence 


Eg :=£:g(E) C M. (35) 


T^M := {X G TpM : dt{X) = 0} . (40) 
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The orthogonal complement is just the span of n at 
p, which we denote by T^M. This gives, at each 
point p of M, the p-orthogonal direct sum 

TpM = T^M® tJ M. (41) 

and associated projections (we drop reference to 

the point p) 

:TM T^M , 

X i-^eg{X,n)n, (42a) 

pll tIIm, 

X ^ X - eg{X,n)n. (42b) 


As already announced in Section]^ we introduced 
the symbol 

e = g(n,n) (43) 


in order to keep track of where the signature mat¬ 
ters. Note that the projection operators (42) are 
self-adjoint with respect to g, so that for all X,Y £ 
TM we have 


g{P^X,Y) =g{X,P^Y), (44a) 

5 (pllA,r) =p(A,plly). (44b) 


A vector is called horizontal iff it is in the kernel 
of P'*', which is equivalent to being invariant under 
pll. It is called vertical iff it is in the kernel of P^, 
which is equivalent to being invariant under P^. 

All this can be extended to forms. We define 
vertical and horizontal forms as those annihilating 
horizontal and vertical vectors, respectively: 

:= {w e t;m : uj{x) = 0, vx e pJm} , 

(45a) 

P;IIm := {w G T*M : uj{X) = 0, VX G T^M) . 

(45b) 


Using the ‘musical’ isomorphisms ([^, the self- 
adjoint projection maps (42) on vectors define self- 
adjoint projection maps on co-vectors (again drop¬ 
ping the reference to the base-point p) 


P^ :=b o P-L o tl : T*M T*^M , (46a) 

p|:=bopllojj (46b) 


For example, letting the horizontal projection of 
the form lo act on the vector X, we get 


Pila;(X) = (pllw«)^(X) 
= 5(pllw#,X) 
= g{J,p\\x) 
= a;(pllx) , 


(47) 


where we merely used the definitions ([^ of b and U 
in the second and fourth equality, respectively, and 
the self-adjointness (44b) of P^ in the third equal¬ 
ity. The analogous relation holds for P^w(X). It 
is also straightforward to check that P]' and P^ 
are self-adjoint with respect to g~^ (cf. (§). 

Having the projections defined for vectors and 
co-vectors, we can also define it for the whole ten¬ 
sor algebra of the underlying vector space, just 
by taking the appropriate tensor products of these 
maps. All tensor products between pH and p| will 
then, for simplicity, just be denoted by pH, the 
action on the tensor being obvious. Similarly for 
P-*". (For what follows we need not consider mixed 
projections.) The projections being pointwise op¬ 
erations, we can now define vertical and horizontal 
projections of arbitrary tensor fields. Hence a ten¬ 
sor field T G TT^M is called horizontal if and only 
if pllp = T. The space of horizontal tensor fields 
of rank {u,d) is denoted by FP^'^M. 

As an example, the horizontal projection of the 
metric g is 

/i:=pllg:=g(pll.,pll.) =g-en^(g)n^ (48) 


Hence h G Another example of a horizon¬ 

tal vector field is the “acceleration” of the normal 
field n: 

a := V„n . (49) 

Here V denotes the Levi-Civita covariant derivative 
with respect to g. An observer who moves perpen¬ 
dicular to the horizontal leaves has four-velocity 
u = cn and four-acceleration c^a. If L denotes the 
Lie derivative, it is easy to show that the accelera¬ 
tion 1-form satisfies 


b r b 

a = LnTl . 
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(50) 





Moreover, as n is hypersurface orthogonal it is ir- 
rotational, hence its 1 -form equivalent satisfies 

dn'’ A = 0 , (51a) 

which is equivalent to the condition of vanishing 
horizontal curl: 


Plldn^ =0. 


(51b) 


Equation (51a I can also be immediately inferred 


directly from (38). Taking the operation (ex¬ 
terior derivative followed by contraction with n) as 
well as the Lie derivative with respect to n of ( [^ 
shows 

da^ A = 0 , (52a) 

an equivalent expression being again the vanishing 
of the horizontal curl of a: 


= 0 . 


(52b) 


This will be useful later on. 

Note that a*' is a horizontal co-vector field, i.e. an 
element of M. More generally, for a purely 

covariant horizontal tensor field we have the follow¬ 
ing results, which will also be useful later on: Let 
T e rr"°M, then 


pllL„T = L„T, (53a) 

= /L„T, (53b) 


for all / G C°°{M). Note that (53a) states that the 
Lie derivative in normal direction of a horizontal 
covariant tensor field is again horizontal. That this 
is not entirely evident follows, e.g., from the fact 
that a corresponding result does not hold for T G 
where u > 0. The proofs of (53) just use 


standard manipulations. 

A fixed space-point g G E defines the worldline 
(history of that point) K 9 s >-)■ £s{q)- The col¬ 
lection of all worldlines of all space-points define a 
foliation of M into one-dimensional timelike leafs. 
Each leaf is now labeled uniquely by a space point. 
We can think of “space”, i.e., the abstract mani¬ 
fold E, as the quotient M/^, where p ~ p' iff both 
points lie on the same worldline. As any E^ in¬ 
tersects each worldline exactly once, each E^ is a 


representative of space. Instead of using the folia¬ 
tion by 3-dimensional spatial leaves (35) we could 
have started with a foliation by timelike lines, plus 
the condition that these lines are vorticity free. 
These two concepts are equivalent. Depending on 
the context, one might prefer to emphasize one or 
the other. 

The vector parallel to the worldline at p = Ssio) 
is, as usual in differential geometry, defined by its 
action on / G C°°{M) (smooth, real valued func¬ 
tions): 


dt 


£s(q) 


/ = 


df{£s'{q)) 


ds' 


(54) 


At each point this vector field can be decomposed 
into its horizontal component that is tangential to 
the leaves of the given foliation and its normal com¬ 
ponent. We write 


1_9 
c dt 


an + d, 


(55) 


where f3 is the tangential part; see Figurej^ The 



Figure 2: For fixed q £ T, its image points p = Ss{q) 
and p' = ^s+dsiQ) for infinitesimal ds are connected by the 
vector d/dt\p, whose components normal to Ss are a (one 
function, called lapse) and /3 (three functions, called shift) 
respectively. 

real-valued function a is called the lapse (function) 
and the horizontal vector field /3 is called the shift 
(vector-field) . 

4.1 Decomposition of the metric 

Let { 60 , 61 , 62 , 63 } be a locally defined orthonor¬ 
mal frame with dual frame (0°, 0^, 0^, 0^}. We call 
them adapted to the foliation if 6 o = n and 0 ° = n*'. 
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A local coordinate system {x^, , x^} is called 

adapted if are horizontal for a = 1,2,3. Note 

that in the latter case d/dx^ is not required to be 
orthogonal to the leaves (i.e. it need not be par¬ 
allel to n). For example, we may take x^ to be 
proportional to t; say x^ = ct. 

In the orthonormal co-frame the spacetime met¬ 
ric, i.e. the field of signature (e, -b, -b, -b) metrics 
in the tangent spaces, has the simple form 

3 

5 = e6»° 0 -b XI ® • (56) 

a—1 

The inverse spacetime metric, i.e. the field of signa¬ 
ture (e, -b, -b, -b) metrics in the co-tangent spaces, 
has the form 

3 

=eeo(g)eo-bX®“®®“' (5^) 

a—1 


The relation that expresses the coordinate basis 
in terms of the orthonormal basis is of the form (in 
a self-explanatory matrix notation) 


f d/dx^ \ _ (a l5°- \ (eoX 
\d/dx^) 1^0 A-J 


(58) 


where /3“ are the components of /3 with respect to 
the horizontal frame basis {ca}. The inverse of (581 
is 


Orthogonality of the Ca implies for the chart 
components of the spatial metric (481 


hmu := h{d/dx^, d/dx'^) = X , (62) 


and its inverse 


Z,™" := h-\dx^,dx-) = (63) 


Inserting (60) into (56) and using (62) leads to 
the (3-bl)-form of the metric in adapted coordi¬ 
nates 


g = + h{fj, dt® dt 

-b cPm {dt (g) dx^ + dx^ (g dt) (64) 
+ hmn dx'^ (g dx^ , 


where Pm '■= hmnP'^ are the components of /3^ := 
g{P, ■) = h{P, ■) with respect to the coordinate ba¬ 
sis {d/dx'^}. Likewise, inserting (61) into (57) and 
using (63) leads to the (3-bl)-form of the inverse 
metric in adapted coordinates (we write dt := d/dt 
and dm ■= djdx^ for convenience) 


g ^ = ec ^ dt® dt 

- ec~^a~‘^ P"^{dt ^dm + dm® dt) (65) 
+ {h"^^+eP"^P^)dm®dn. 



[A- 


/ d/dx^ 

yd/dx” 


(59) 


Finally we note that the volume form on space- 
time also easily follows from (60) 


where /J™ are the components of /3 with respect 
to the horizontal coordinate-induced frame basis 


{d/dx'^}. 

The relation for the co-bases dual to those in (58) 
is given by the transposed of (58), which we write 
as: 


dgg = 9° A 9^ A 0^ A 9^ 

= a\/det{hmn} cdt A d^x , 


( 66 ) 


where we use the standard shorthand d^x = dx^ A 
dx"^ A dx^. 


(00 0 “) = (dx° dx^) 


a /3“ 
0 A“ 


(60) 


4.2 Decomposition of the 
covariant derivative 


The inverse of that is the transposed of (59): 


{dx^ dx'^) = {9° 



J 


( 61 ) 


Given horizontal vector fields X and Y, the covari¬ 
ant derivative of Y with respect to X need not be 
horizontal. Its decomposition is written as 

XxY ^ DxY + nKiX,Y), (67) 
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where 


DxY (68) 
K{X,Y):=eg{n,WxY). (69) 

The map D defines a covariant derivative (in the 
sense of Kozul; compare |110j . Vol2) for horizon¬ 
tal vector fields, as a trivial check of the axioms 
reveals. Moreover, since the commutator [X, Y] 
of two horizontal vector fields is always horizontal 
(since the horizontal distribution is integrable by 
construction), we have 

T^{X, Y) = DxY - DyX - [X, Y] 

= P\\{\/xY-VyX-[X,Y]) (70) 

= 0 


due to V being torsion free. We recall that tor¬ 
sion is a tensor field T G FTjM associated to each 
covariant derivative V via 


T^(X,Y) =VxY-VrX-[X,Y]. (71) 


We have T(X,Y) = —T(Y,X). As usual, even 
though the operations on the right hand side of 
( [7T| ) involve tensor fields (we need to differentiate), 
the result of the operation just depends on X and Y 
pointwise. This one proves by simply checking the 
validity of T(fX,Y) = fT{X,Y) for all smooth 
functions /. Hence (70) shows that D is torsion 


free because V is torsion free. 

Finally, we can uniquely extend D to all hori¬ 
zontal tensor fields by requiring the Leibniz rule. 
Then, for X, Y, Z horizontal 


{Dxh){Y,Z) 

= X{h(Y, Z)) - h(DxY, Z) - h(Y, DxZ) 
= X(g{Y,Z)) - g{VxY,Z) - g{Y,VxZ) 
= {Vxg)(Y,Z)=Q 


due to the metricity, Xg = 0, of V. Hence D is 
metric in the sense 


D/i = 0 . (73) 

The map K from pairs of horizontal vector fields 
{X,Y) into functions define a symmetric tensor 


field. Symmetry follows from the vanishing torsion 
of V, since then 


K(X,Y) = £g{n,XxY) 

= eg(n,VyX + [X,y]) 
= ^gin.XyX) 

= K{Y,X) 


(74) 


for horizontal X^Y. From (69) one sees that 
K(fX,Y) = fK(X,Y) for any smooth function 
/. Hence K defines a unique symmetric tensor 
field on M by stipulating that it be horizontal, i.e. 
K(ny) = 0. It is called the extrinsic curvature of 
the foliation or second fundamental form, the first 
fundamental form being the metric. From (69) 
and the symmetry just shown one immediately in¬ 
fers the alternative expressions 


K(X,Y) = -eg(Xxn,Y) = -eg(Xyn,X). 

(75) 

This shows the relation between the extrinsic cur¬ 
vature and the Weingarten map, Wein, also called 
the shape operator, which sends horizontal vectors 
to horizontal vectors according to 

X Wein(X) := Xxn . (76) 


Horizontality oiXxn immediately follows from n 
being normalized: g(n,Xxn) = \X{g{n,n)') = 0 . 
Hence (75) simply becomes 


K(X,Y) = -eh{Wein(X),Y) 
= -e h{X,Wein{Y)) , 


(77) 


where we replaced g with h —defined in (48)— 
since both entries are horizontal. It says that K 
is (—e) times the covariant tensor corresponding 
to the Weingarten map, and that the symmetry 
of K is equivalent to the self-adjointness of the 
Weingarten map with respect to h. The Wein¬ 
garten map characterizes the bending of the em¬ 
bedded hypersurface in the ambient space by an¬ 
swering the following question: In what direction 
and by what amount does the normal to the hy¬ 
persurface tilt if, starting at point p, you progress 
within the hypersurface by the vector X. The an¬ 
swer is just Weinp(X). Self adjointness of Wein 
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then means that there always exist three (n — 1 
in general) perpendicular directions in the hyper¬ 
surface along which the normal tilts in the same 
direction. These are the principal curvature direc¬ 
tions mentioned above. The principal curvatures 
are the corresponding eigenvalues of Wein. 

Finally we note that the covariant derivative of 
the normal field n can be written in terms of the 
acceleration and the Weingarten map as follows 

Vn = en*’ 0 a + Wein . (78) 


Recalling the purely covariant version of this 
is 


=-e{K-n^ ®a^) . (79) 


From (481 and (79) we derive by standard manip¬ 
ulation, using vanishing torsion, 


Lnh = -2eK . (80) 


In presence of torsion there would be an addi¬ 
tional term -\-2{inT)\, where the subscript s de¬ 
notes symmetrization; in coordinates = 


5 Curvature tensors 


assume a minimal and a maximal value, denoted 

by kmin{p) = k{p,Vmin) and k^axip) = k(jJ,Vmax) 

respectively. These are called the principal curva¬ 
tures of S' at p and their reciprocals are called the 
principal radii. It is clear that the principal direc¬ 
tions Vmin and Umax just Span the eigenspaces of 
the Weingarten map discussed above. In particu¬ 
lar, Vmin and Umax are orthogonal. The Gaussian 
curvature K(p) of S at p is then defined to be the 
product of the principal curvatures: 

K{p) = kminip) ■ kmaxip) ■ (81) 

This definition is extrinsic in the sense that essen¬ 
tial use is made of the ambient in which S is em¬ 
bedded. However, Gauss’ theorema egregium states 
that this notion of curvature can also be defined in¬ 
trinsically, in the sense that the value K (p) can be 
obtained from geometric operations entirely car¬ 
ried out within the surface S. More precisely, it is 
a function of the first fundamental form (the met¬ 
ric) only, which encodes the intrinsic geometry of 
S', and does not involve the second fundamental 
form (the extrinsic curvature), which encodes how 
S is embedded into 

Let us briefly state Gauss’ theorem in mathemat¬ 
ical terms. Let 


We wish to calculate the (intrinsic) curvature ten¬ 
sor of V and express it in terms of the curvature 
tensor of D, the extrinsic curvature K, and the 
spatial and normal derivatives of n and K. Be¬ 
fore we do this, we wish to say a few words on the 
definition of the curvature measures in general. 

All notions of curvature eventually reduce to that 
of curves. For a surface S embedded in we 
have the notion of Gaussian curvature which comes 
about as follows: Gonsider a point p G S and a unit 
vector V at p tangent to S. Consider all smooth 
curves passing through p with unit tangent v. It 
is easy to see that the curvatures at p of all such 
curves is not bounded from above (due to the pos¬ 
sibility to bend within the surface), but there will 
be a lower bound, k{p,v), which just depends on 
the chosen point p and the tangent direction repre¬ 
sented by V. Now consider k(p,v) as function of v. 
As V varies over all tangent directions k{p, v) will 


g = gab dx°' G dx^ (82) 

be the metric of the surface in some coordinates, 
and 

^ab ~ {~9dgab + dagbd + dhgda) i (83) 

certain combinations of first derivatives of the met¬ 
ric coefficients, known under the name of Christof- 
fel symbols . Note that F^f, has as many indepen¬ 
dent components as daPbc and that we can calculate 
the latter from the former via 

dcPab = 9an^bc + 9bn^ac ■ (^ 4 ) 

Next we form even more complicated combinations 
of first and second derivatives of the metric coeffi¬ 
cients, namely 

R\cd = dc^b - ddT:, + - r^F^,, (85) 
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which are now known as components of the Rie- 
mann curvature tensor. From them we form the 
totally covariant (all indices down) components: 

Rah cd — 9anR bed ' 


They are antisymmetric in the first and second 
index pair. Rated — Rbacd — Rab dei SO that 
i?i 2 i 2 is the only independent component. Gauss’ 
theorem now states that at each point on S we have 


K = 


Ri2 ] 


911922 - 9i2 


(87) 


An important part of the theorem is to show that 


the right-hand side of (871 actually makes good 


geometric sense, i.e. that it is independent of the 
coordinate system that we use to express the coef¬ 
ficients. This is easy to check once one knows that 
Rated are the coefficients of a tensor with the sym¬ 
metries just stated. In this way the curvature of 
a surface, which was primarily defined in terms of 
curvatures of certain curves on the surface, can be 
understood intrinsically. In what follows we will see 
that the various measures of intrinsic curvatures of 
n-dimensional manifolds can be reduced to that of 
2 -dimensional submanifolds, which will be called 
sectional curvatures. 

Back to the general setting, we start from the 
notion of a covariant derivative V. Its associated 
curvature tensor is defined by 

Rix, Y)Z = (VxVy - Vy Vx - V[x,y])^ . (88) 

For each point p G M it should be thought of 
as a map that assigns to each pair X,Y G TpM 
of tangent vectors at p a linear map R{X,Y) : 
TpM —> TpM. This assignment is antisymmetric, 
i.e. R{X,Y) = -R{Y,X). It R{X,Y) is applied 
to Z the result is given by the right-hand side of 


(88). Despite first appearance, the right-hand side 
of (88) at a point p G M only depends on the 


values of X, Y, and Z at that point and hence de¬ 
fines a tensor field. This one again proves by show¬ 
ing the validity of R{fX,Y)Z = R{X,fY)Z — 
R{X,Y)fZ = fR{X,Y)Z for all smooth real¬ 
valued functions / on M. In other words: All terms 
involving derivatives of / cancel. 


From (88) and using 0 one may show that the 
Riemann tensor always obeys the first and second 
Bianchi identities: 


E RiX,Y)Z 

(XYZ) 

= E {(VxT)(r,z)-r(A,T(r,z))}, 

(XYZ) 

(89a) 

E (Vxi?)(D,Z) 

(XYZ) 

= E R{X,T{Y,Z)), (89b) 

(XYZ) 

where the sums are over the three cyclic permu¬ 
tations of X, Y, and Z. For zero torsion these 
identities read in component form: 


E = 

(90a) 

(Xv-v) 


E VAi?“^^,=0. 

(90b) 

(A/ji/) 



The second traced on (a, p) and contracted with 
yields (—2) times (13). 

The covariant Riemann tensor is defined by 

Riem(lF, Z, A, Y) := g{W, R{X, Y)Z) . (91) 


For general covariant derivatives its only symme¬ 
try is the antisymmetry in the last pair. But for 
special choices it acquires more. In standard GR 
we assume the covariant derivative to be metric 
compatible and torsion free: 


V5 = 0, (92) 

T = 0. (93) 


In that case the Riemann tensor has the symme¬ 
tries 


Riem(IF, Z, A, Y) = -Riem(lF, Z, Y, X) , (94a) 
Riem(IF, Z, A, Y) = -Riem(Z, W, A, Y) , (94b) 
Riem(W, A, Y, Z) + Riem(lF, Y, Z, A) -b 
Riem(W, Z, A, A) = 0, (94c) 

Riem(lF, Z, A, Y) = Riem(A, Y, W, Z) . (94d) 
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Equation (94a) is true by definition ( 88 ), (94b) is 


equivalent to metricity of V, and (94c) is the first 
Bianchi identity in case of zero torsion. The last 


symmetry (94d) is a consequence of the preceding 


three. Together ( |94a[ ), ( |94b[ ), and ( |94d[ ) say that, 
at each point p G M, Riem can be thought of 
as symmetric bilinear form on the antisymmetric 
tensor product TpM A TpM. The latter has di¬ 
mension N = |n(n — 1) if M has dimension n, 
and the space of symmetric bilinear forms has di¬ 
mension ^N{N + 1). From that number we have 
to subtract the number of independent conditions 


(94c), which is (") in dimensions n > 4 and zero 
otherwise. Indeed, it is easy to see that (94c) is 


identically satisfied as a consequence of (94a) and 


(94b) if any two vectors W, Z, X, Y coincide (pro¬ 


portionality is sufficient). Hence the number ^ of 
independent components of the curvature tensor is 

#Riem = 


for 

n > 4 

for 

n = 3 

for 

n = 2 


= — 1 ) for all n > 2 . 


(95) 


The Ricci and scalar curvatures are obtained 
by taking traces with respect to g: Let {ei, • • • , e„} 
be an orthonormal basis, g{ea, eb) = SabSa (no sum¬ 
mation) with Sa = ± 1 , then 

n 

Ric(X,y) = ^£aRiem(ea,^,ea,E) (96) 

a—1 
n 

Seal = £a Ric(ea, ea ) • (97) 

a—1 

The Einstein tensor is 

Ein = Ric — iScalg . (98) 

The sectional curvature is defined by 
Riem(X, F, X, Y) 


Sec(X, Y) = 


g{X,X)g{Y,Y)-[g{X,Y)\ 


(99) 


Here X, F is a pair of linearly independent tangent 
vectors that span a 2 -dimensional tangent subspace 
restricted to which g is non-degenerate. We will 
say that spanjX, F} is non-degenerate. This is the 
necessary and sufficient condition for the denomi¬ 
nator on the right-hand side to be non zero. The 
quantity Sec(X, F) is called the sectional curva¬ 
ture of the manifold {M, g) at point p tangent to 
span{X, F}. From the symmetries of Riem it is 
easy to see that the right-hand side of (99) does 


indeed only depend on the span of X, Y. That is, 
for any other pair X', Y' such that spanj^', F'} = 
span{X, F}, we have Sec(X',F') = Sec(X, F). 
The geometric interpretation of Sec)^, F) is as 
follows: Consider all geodesics of {M,g) that pass 
through the considered point p G M in a, direc¬ 
tion tangential to spanjX, F}. In a neighborhood 
of p they form an embedded 2-surface in M whose 
Gaussian curvature is just Sec(X, F). 

Now, Riem is determined by components of the 
form Riem(X, F, X, F), as follows from the fact 
that Riem is a symmetric bilinear form on TM A 
TM. This remains true if we restrict to those X, Y 
whose span is non-degenerate, since they lie dense 
in TM A TM and (X, F) Riem(X, F, X, F) is 
continuous. This shows that the full information 
of the Riemann tensor can be reduced to certain 
Gaussian curvatures. 

This also provides a simple geometric interpre¬ 
tation of the scalar and Einstein curvatures in 
terms of sectional curvatures. Let {Xi,--- ,X„} 
be any set of pairwise orthogonal non-null vec¬ 
tors. The |n(n — 1) 2-planes span{Xa,Xh} are 
non-degenerate and also pairwise orthogonal. It 


then follows from (97) and (99) that the scalar cur¬ 


vature is twice the sum of all sectional curvatures: 


Seal = 2 Sec{Xa,Xb). 


( 100 ) 


a,b = l 
a<b 


The sum on the right-hand side of (100) is the same 
for any set of ^n{n — 1 ) non-degenerate and pair¬ 
wise orthogonal 2-planes. Hence the scalar curva¬ 
ture can be said to be twice the sum of mutually or¬ 
thogonal sectional curvatures, or n{n— 1 ) times the 
mean sectional curvature. Similarly for the Ricci 
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and Einstein curvatures. The symmetry of the 
Ricci and Einstein tensors imply that they are fully 
determined by their components Ric(VT, W) and 
Ein(lE, W). Again this remains true if we restrict 
to the dense set of non-null W, i.e. g{W, W) ^ 0. 
Let now {Xi,--- ,X„_i} be any set of mutually 
orthogonal vectors (again they need not be nor¬ 
malized) in the orthogonal complement of W. As 
before the |(n — l)(n — 2 ) planes spanjAla,X(,} 
are non degenerate and pairwise orthogonal. From 
(96), (98), and (99) it follows that 


their Kulkarni-Nomizu product is defined by 

fc®£(Xl,X2,X3,X4) :=fc(Xi,X3)^(X2,X4) 

+ k{X2,X4)e{Xi,X3) 
-k{Xi,X4)e{X2,X3) 
-k{X2,X3)e{Xi,X4), 
(103) 

or in components 

{k(Q)^^abcd — kac^bd~\~khd^ac klyc^ad ■ 


n—1 

Ric(lT, W) = g{W, W) ^ Sec(lE, Xa) (101) 

a—1 


and 


n—1 

Ein(lE,lE) = -giW,W) ^ Sec(X„Xb). 

(102) 

Again the right-hand sides will be the same for any 
set {All, • • • , Al„_i} of n — 1 mutually orthogonal 
vectors in the orthogonal complement of W. Note 
that Ric(lE, W) involves all sectional curvatures 
involving W whereas Ein(lT, W) involves all sec¬ 
tional curvatures orthogonal to W. For normalized 
W, where g{W, W) = a = ±1, we can say that 
—crG'(lF, W) is the sum of sectional curvatures or¬ 
thogonal to W, or i(n—l)(n —2) times their mean. 
Note that for timelike W we have tr = — 1 and 
G{W, W) is just the sum of spatial sectional curva¬ 
tures. 


Finally we mention the Weyl curvature tensor, 
which contains that part of the information in the 
curvature tensor not captured by the Ricci (or 
Einstein-) tensor. To state its form in a compact 
form, we introduce the Kulkarni-Nomizu product^ 
denoted by an encircled wedge, ®, which is a bi¬ 
linear symmetric product on the space of covari¬ 
ant symmetric rank-two tensors with values in the 
covariant rank-four tensors that have the symme¬ 
tries (94) of the Riemann tensor. Let k and i be 
two symmetric covariant second-rank tensors, then 


The Weyl tensor, Weyl, is of the same type as 
Riem but in addition totally trace-free. It is ob¬ 
tained from Riem by a projection map, Pwy given 

by 

Weyl := P 4 v(Riem) 

:= Riem - (Ric - ® g . 

(105) 


Piv is a linear map from the space of rank-four 
tensors with Riemann symmetries to itself. It is 
easy to check that its image is given by the totally 
trace-free such tensors and that the kernel consists 
of all tensors of the form g® AT, where AT is a sym¬ 
metric rank-two tensor. The latter clearly implies 
PwoPw = Pw- The dimension of the image corre¬ 
sponds to the number of independent components 
of the Weyl tensor, which is given by (95) minus 
the dimension 
for n > 3 


2 -n(n - 


1) of the kernel. This gives 


#Weyl = rPn{n 1) [n{n — 1) — 6 ] (106) 


and zero for n = 2. Note that in n = 3 dimensions 


the Weyl tensor also always vanishes, so that (105) 


can be used to express the Riemann tensor in terms 
of the Ricci and scalar curvature 


Riem = (Ric—jScalg) ®g (for n = 3). (107) 

A metric manifold (M, g) is said to be of constant 
curvature if 

Riem = fcg®g (108) 
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for some function k. Then Ric = 2k{n — l)g and 
Ein = —k{n — l)(n — 2)g. We recall that mani¬ 
folds (M, g) for which the Einstein tensor (equiv¬ 
alently, the Ricci tensor) is pointwise proportional 
to the metric are called Einstein spaces. The twice 


a different meaning from that given to it in (48). 


contracted second Bianchi identity (13) shows that 
k must be a constant unless n = 2. For n = 3 


equation (107) shows that Einstein spaces are of 


constant curvature. 

5.1 Comparing curvature tensors 

Sometimes one wants to compare two different cur¬ 
vature tensors belonging to two different covariant 
derivatives V and V. In what follows, all quan¬ 
tities referring to V carry a hat. Recall that a 
covariant derivative can be considered as a map 
V : TT^M X TT^M TT^M, {X,Y) ^ XxY, 
which is C°°(M)-linear in the first and a deriva¬ 
tion in the second argument. That is, for / G 
C~(M) have Vfx+yZ = fVxZ + VyZ and 
VxifY + Z) = Xif)Y + fVxY + VxZ. This 
implies that the difference of two covariant deriva¬ 
tives is C°°{M)- linear also in the second argument 
and hence a tensor field: 


V - V =: A G TTiM . 


(109) 


Replacing V with V -f A in the definition of the 
curvature tensor for V according to (88) directly 
leads to 

R{X,Y)Z = R{X,Y)Z 

+ iVxA){Y,Z)-{VYA){X,Z) 

+ A{X,AiY,Z))-A{Y,AiX,Z)) 

+ A{T{X,Y),Z)). 

( 110 ) 

Note that so far no assumptions have been made 
concerning torsion or metricity of V and V. This 
formula is generally valid. In the special case where 
V and V are the Levi-Civita covariant derivatives 
with respect to two metrics g and g, we set 


h:=g-g, 


( 111 ) 


which is a symmetric covariant tensor field. Note 
that here, and for the rest of this subsection, h has 


We recall that the Levi-Civita covariant derivative 
is uniquely determined by the metric. For V this 
reads 

2g{VxY,Z) 

= X{g{Y, Z)) + Y{g{Z, X)) - Z(g(X, Y)) 

- g(X, [Y, Z])]) + g{Y, [Z, A])]) + g{Z, [X, F])]). 

( 112 ) 


Subtracting (112) from the corresponding formula 


with V and g replaced by V and g yields, using 
T = 0, 

2g{A(X,Y),Z) = 

- {yzh){X, Y) + {Xxh)iY, Z) + (Vyh)(Z, A). 

(113) 

This formula expresses A as functional of g and g. 
There are various equivalent forms of it. We have 
chosen a representation that somehow minimizes 
the appearance of g. Note that g enters in h as 
well as V, whereas g enters in h and via the scalar 
product on the left-hand side. The latter obstructs 
expressing A as functional of g and h alone. In 
components ( |113| ) reads 

■ (114) 

Note that one could replace the components of h 
with those oi g = g -\-hm. the bracket on the right- 
hand side, since the covariant derivatives of g van¬ 
ish. 

Now suppose we consider h and its first and sec¬ 
ond derivatives to be small and we wanted to know 
the difference in the covariant derivatives and cur¬ 
vature only to leading (linear) order in h. To that 
order we may replace g with g on the left-hand side 


of (1131 and the right-hand side of (1141. Moreover 


we may neglect the A-squared terms in (110) and 


obtain, writing SR for the first order contribution 
to R — R, 

5R\,^ = V,A^, - VdA“,. (115) 

From this the first-order variation of the Ricci ten- 
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sor follows, writing hab =■ Sgab, 

SRab = ^uKb - 

= \{-^g5gab-VaVb5g (116) 

+ V"V,5g„6 + V"Vf,5g„,), 

where Ag := g^'^Va'^b and Sg = g°-^ Sgab- Finally, 
the variation of the scalar curvature is (note 6g°'^ = 
-g^^g^‘^6g,d = -h^^) 

dR = RabSg^^ + VaU\ 


where 


jja ^ _ an 

^ y ‘—^nm y ‘-^Ti 


= G‘^’’‘^^WbSg,d. 


(117a) 


(117b) 


Here we made use of the De Witt metric, which de¬ 
fines a symmetric non-degenerate bilinear form on 
the space of symmetric covariant rank-two tensors 
and which in components reads: 


G' 


ibcd 


= h - 25 “ V'") • (118) 


We will later have to say more about it. 

We also wish to state a useful formula that com¬ 
pares the curvature tensors for conformally related 
metrics, i.e. 

9 = e^*g, (119) 

where ^ : M —>■ M is smooth. Then 


Rienig = 


Riem„ 


5' 


)K 


(120a) 


with 


K ■=—V^<j) + d<j) ® d4i — \g ^ (dcj), dfjj) g . (120b) 

(This can be proven by straightforward calculations 
using either (88) and (112), or Cartan’s structure 


equations, or, most conveniently, normal coordi¬ 


nates.) From (120a I and the fact that the kernel 


of the map Pw in (105) is given by tensors of the 


form g ® K it follows immediately that 
Weylg = e^^Weylg . 


( 121 ) 


This is equivalently expressed by the conformal in¬ 
variance of the contravariant version of the Weyl 


tensor, which is related to the covariant form, 


Weyl, in the same way (911 as the curvature ten¬ 


sor R is related to Riem (i.e., by raising the first 
index of the latter). 


From (120) we also deduce the transformation 


properties of the Ricci tensor: 


RiCg =RiCg 


- {Ag(j) + {n - 2)g ^{d4>,d(t))) g 

— (n — 2)(VV()) — d(p® d(j)). 


( 122 ) 


where, as above, Ag denotes again the Lapla- 
cian/d’Alembertian for g. Finally, for the scalar 
curvature we get 

Scalg = ^Scalg 

— 2{n — l)Ag()) 

- (n - l)(n - 2)g~^{d(j), d<j))^ . 

(123) 

This law has a linear dependence on the second and 
a quadratic dependence on first derivatives of (j). If 
the conformal factor is written as an appropriate 
power of some positive function il : M ^ K+ we 
can eliminate all dependence on first and just retain 
the second derivatives. In n > 2 dimensions it is 
easy to check that the rule is this: 


52 ^ = , 


then (123) becomes 


Scalg = — 


4(n-l)^_^ 


n — 2 


-^VgU, 


where 


f) _ 2 

Vg = Ag - — -^ Scalg 

^ " 4(n - 1) ^ 


(124) 


(125a) 


(125b) 


Vg is a linear differential operator which is ellip¬ 
tic for Riemannian and hyperbolic for Lorentzian 


metrics g. If we set fl = H 1 H 2 and apply (125) 


twice, one time to the pair {g,g), the other time 
to {g,Q. 2 g), we obtain by direct comparison (and 
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renaming 172 to thereafter) the conformal trans¬ 
formation property for the operator Vg-. 

=M(n-^^oVgoM{n), (126) 

where M(il) is the linear operator of multiplica¬ 
tion with n. This is the reason why T>g is called 
the conformally covariant Laplacian (for Rieman- 
nian g) or the conformally covariant wave operator 
(for Lorentzian g). As we will see, it has useful 
applications to the initial-data problem in GR. 


5.2 Curvature decomposition 


Using (671 we can decompose the various curvature 
tensors. First we let X,Y,Z be horizontal vector 
fields. We use (67) in (88) and get the general 
formula (i.e. not yet making use of the fact that V 
and D are metric and torsion free) 


R{X,Y)Z = R^{X,Y)Z 

+ (Vxn) K{Y, Z) - (Vyn) K{X, Z) 

+ n[{DxK){Y, Z) - {DyK){X, Z)] 

+ nK{T^{X,Y),Z), 

(127) 


where 


R^{X,Y,)Z:= {DxDy-DyDx-D^x,y])Z 

(128) 

is the horizontal curvature tensor associated to the 
Levi-Civita covariant derivative D of h. This for¬ 
mula is general in the sense that it is valid for any 
covariant derivative. No assumptions have been 
made so far concerning metricity or torsion, and 
this is why the torsion of D (defined in @) 
makes an explicit appearance. From now on we 
shall restrict to vanishing torsion. We observe that 
the first two lines on the right-hand side of (127) 
are horizontal whereas the last two lines are pro¬ 
portional to n. Decomposition into horizontal and 
normal components, respectively, leads to (where 
= 0 and X, Y, Z, and W are horizontal). 


Riem(W, Y, A, Y) = Riem^(W, Y, A, Y) 

- e [K(W, X)K{Z, Y) - K{W, Y)K{Z, A)] . 

(129) 


Here we used h{W, Vjcn) 
and 


-eK{W,X) from (75), 


Riem(n, Y, A, Y) 

= s[{DxK){Y,Z)-{DYK)iX,Z)]. 


Here and in the sequel we return to the meaning of 
h given by (48). In differential geometry (129) is 
referred to as Gauss equation and (130) as Codazzi- 
Mainardi equation. 

The remaining curvature components are those 
involving two entries in n direction. Using (79) 
we obtain via standard manipulations (now using 
metricity and vanishing torsion) 


Riem(A, n, Y, n) 

= jjs:(VvV„ — V„Vv — ^[Y,n])n^ 

= ixivi^LriK + K o K + Daf — eaf 0 of) . 

(131) 


Here K o K {X,Y) := h-\ixK,iYK) = 

ixK(^{iYKY) and we used the following relation 
between covariant and Lie derivative (which will 
have additional terms in case of non-vanishing tor¬ 
sion): 

XnK ^ LnK + 2sK o K . (132) 


Note also that the left-hand side of (131) is sym¬ 
metric as consequence of (94d). On the right-hand 
side only Da^ is not immediately seen to be sym¬ 
metric, but that follows from (52b). Unlike (129) 
and (130), equation (131) does not seem to have a 
standard name in differential geometry. 

Equations (127), (129), and (130) express all 
components of the spacetime curvature in terms of 
horizontal quantities and their Lie derivatives L„ 
in normal direction. According to (55) the latter 
can be replaced by a combination of Lie derivatives 
along the time vector-field d/dt and the shift (3. 
From (53b) we infer that Lq,„ = on horizontal 
covariant tensor fields, therefore we may replace 


An ^ ^ - l]^) (133) 


on horizontal covariant tensor fields. Here we set 
LII = pll oL, i.e. Lie derivative (as operation in the 


21 






















ambient spacetime) followed by horizontal projec¬ 
tion. Moreover, using (50), one easily sees that the 


acceleration 1 -form can be expressed in terms of 
the spatial derivative of the lapse function: 


a}’ = —ea ^ Da . 


(134) 


Hence the combination of accelerations appearing 


in (130) may be written as 


Da^ — ea^ 0 a'’ = —ea ^D^a . 


(135) 


Note that D^a := DDa is just the horizontal co¬ 
variant Hessian of a with respect to h. 


6 Decomposing 
equations 


Einstein’s 


The curvature decomposition of the previous sec¬ 
tion can now be used to decompose Einstein’s equa¬ 
tions. For this we decompose the Einstein ten¬ 
sor Ein into the normal-normal, normal-tangential, 
and tangential-tangential parts. Let {cq, ei, 62 , 63 } 
be an orthonormal frame with eg = n, i.e. adapted 


to the foliation as in Section 4.1 Then (102) to¬ 


gether with ( |129[ ) immediately lead to 

2 Ein(eo, eg) = - [K^bK^^ - {Kf] - eScal^ , 

where Scal^ is the scalar curvature of D, i.e. of 
the spacelike leaves in the metric h. Similarly we 


obtain from (130) 


Ein(eg,ea) = Ric(eg,ea) = -e[D^Kab - DaK^] . 

(137) 

The normal-normal component of the Ricci ten¬ 
sor cannot likewise be expressed simply in terms of 
horizontal quantities, the geometric reason being 
that, unlike the Einstein tensor, it involves non¬ 


horizontal sectional curvatures (compare ( 101 ) and 
( 102[ )). A useful expression follows from taking the 
trace of (131), considered as symmetric bilinear 


form in X and Y. The result is : 


Ric(eg, eg) = -KabK^^^ + {KY + eV • E, (138) 


where V • denotes the divergence with respect to V 
and E is a vector field on M whose normal com¬ 
ponent is the trace of the extrinsic curvature and 
whose horizontal component is e times the acceler¬ 
ation on n: 

V = nKl + ea . (139) 

For the horizontal-horizontal components of Ein¬ 
stein’s equation it turns out to be simpler to use 


their alternative form ( 6 b) with the Ricci tensor on 


the left hand side. For that we need the horizontal 
components of the Ricci tensor, which we easily get 


from (129) and (131): 


Ric(ea,eb) = Ric^(ea,eb) 

+ L^Kab + 2eKacK^ - eK^bK 
4 “ eDdQb aaab • 

(140) 

For later applications we also note the expression 
for the scalar curvature. It follows, e.g., from 


adding the horizontal trace of (140) to e times 


(138). This leads to 

Seal = Scal^ - £ [KabK‘^^ - (KY] + 2V • E . 

(141) 

Here we made use of the relation between the V 
and D derivative for the acceleration 1-form: 

Va^ = Da'’ +en^(^ VnY + laK ® n '’, (142) 

whose trace gives the following relation between 
the V and D divergences of a: 


X ■ a = D ■ a — eh(a, a). 


(143) 


Another possibility would have been to use (136) 


and (138) in Seal = —2£(Ein(eg, eg) —Ric(eg, eg)). 

Using (136) and (137), and also using the 
DeWitt metric (118) for notational ease, we can 


immediately write down the normal-normal and 
normal-tangential components of Einstein’s equa¬ 
tions 

G‘’^‘’‘^KabK^d + eScal° = -2k T(n, n), (144a) 

G^^bcdDbK^^ = -eK/i“''T(n, e^). (144b) 
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From and ( |118| ) we notice that the bilinear 
form on the left-hand side of (144a) can be written 
as 


G{K,K): = G‘^^‘^^KabK,d 

= Tr(Wein o Wein) — (Tr(Wein))^ . 

(145) 


Here the trace is natural (needs no metric for its 
dehnition) since Wein is an endomorphism. In a 
local frame in which Wein is diagonal with entries 
k := (fci, k 2 , fcs) we have 


G{K, K) := (<5“'’ - 3n‘^n^)kakb , (146) 


in 


Kab := 

cat 

= {LlK)^^ + D^Dba 

+ a[-2eKacK^ + sKabK^ - Ric^(ea, e^)] 

- a£:^habT^{n,n) 

+ aK{T - ;j^Trft,(T) h) (co, e&). 

(148) 


Note that in the last term the trace of T is taken 
with respect to h and not g. The relation is 
Tr;,(T) =Trg(T)-eT(n,n). 

The only remaining equation that needs to be 
added here is that which relates the time derivative 
of h with K. This we get from (80) and (133): 


where n“ are the components of the normalized 
vector (l,l,l)/v^ in eigenvalue-space, which we 
identify with endowed with the standard Eu¬ 
clidean inner product. Hence, denoting by 9 the 
angle between n and k, we have 


G{K,K) 


0 


> 0 
< 0 


if I cos0| = \/l/3 

if I cos0| < \/l/3 (147) 

if I cos0| > 1 / 173 . 


hab ■■= = {L'U)ab - ■ (149) 


Equations (1491 and (148) are six first-order in 


time evolution equations for the pair (h,K). This 
pair cannot be freely specified but has to obey the 


four equations (144a) and (144b I which do not 


contain any time derivatives of h or K. Equa¬ 


tions (144a I and (144b) are therefore referred to as 
constraints, more specifically ( 144a| ) as scalar con¬ 
straint (also Hamiltonian constraint) and (144b) as 
vector constraint (also diffeomorphism constraint). 


Note that | cos6*| = \/l/3 describes a double cone 
around the symmetry axis generated by n and ver¬ 
tex at the origin, whose opening angle just is right 
so as to contain all three axes of For eigenvalue- 
vectors inside this cone the bilinear form is nega¬ 
tive, outside this cone positive. Positive G{K, K) 
require sufficiently anisotropic Weingarten maps, 
or, in other words, sufficiently large deviations from 
being umbilical points. 


The horizontal-horizontal component of Ein¬ 
stein’s equations in the form immediately fol¬ 
lows from (140). In the ensuing formula we use 
(133) to explicitly solve for the horizontal Lie 
derivative of K with respect to d/cdt and also (135) 
to simplify the last two terms in (140). This results 


We derived these equations from the 3+1 split 
of a spacetime that we considered to be given. De¬ 
spite having expressed all equations in terms of hor¬ 
izontal quantities, there is still a relic of the ambi¬ 
ent space in our equations, namely the Lie deriva¬ 
tive with respect to djdct. We now erase this last 
relic by interpreting this Lie derivative as ordinary 
partial derivative of some t-dependent tensor held 
on a genuine 3-dimensional manifold E, which is 
not thought of as being embedded into a space- 
time. The horizontal projection of the space- 
time Lie derivative that appears on the right-hand 
sides of the evolution equations above then trans¬ 
lates to the ordinary intrinsic Lie derivative on E 
with respect to j3. This is how from now on we shall 
read the above equations. Spacetime does not yet 


23 




















exist. Rather, it has to be constructed from the 
evolution of the fields according to the equations, 
usually complemented by the equations that gov¬ 
ern the evolution of the matter fields. In these 
evolution equations a and /3 are freely specihable 
functions, the choice of which is subject to mathe¬ 
matical/computational convenience. Once a and /3 
are specified and h as a function of parameter-time 
has been determined, we can form the expression 


(64) for the spacetime metric and know that, by 


construction, it will satisfy Einstein’s equations. 

To sum up, the initial-value problem consists in 
the following steps: 

1. Choose a 3-manifold E. 

2. Choose a time-parameter dependent lapse 
function a and a time-parameter dependent 
shift vector-field /3. 

3. Find a Riemannian metric h G S and a 
symmetric covariant rank-2 tensor field K G 


rT 2 E that satisfy equations (144a) and (144b) 


either in vacuum (T = Ta; cf. 13 ) , or after 
specifying some matter model. 


4. Evolve these data via (149) and (148), possibly 


complemented by the evolution equations for 
the matter variables. 

5. Construct from the solution the spacetime 


metric g via (64). 


For this to be consistent we need to check that the 


evolution according to (149) and (148) will pre¬ 
serve the constraints (144a) and (144b). At this 


stage this could be checked directly, at least in 
the vacuum case. The easiest way to do this is 
to use the equivalence of these equations with Ein¬ 
stein’s equations and then employ the twice con¬ 


tracted 2nd Bianchi identity (13). It follows that 
= 0, where -k The 


four constraints (144a) and (144b) are equivalent 
to = 0 and = 0 , and the six second-order 
equations E™" = 0 to the twelve first-order evolu¬ 


tion equations (149) and (148). In coordinates the 


identity = 0 reads 




_ ^ (150) 


which shows immediately that the time derivatives 
of the constraint functions are zero if the con¬ 
straints vanished initially. This suffices for ana¬ 
lytic data, but in the general case one has to do 
more work. Fortunately the equations for the evo¬ 
lution of the constraint functions can be put into 
an equivalent form which is manifestly symmetric 
hyperbolic [62]. That suffices to conclude the 
preservation of the constraints in general. In fact, 
symmetric hyperbolicity implies more than that. It 
ensures the well-posedness of the initial-value prob¬ 
lem for the constraints, which not only says that 
they stay zero if they are zero initially, but also 
that they stay small if they are small initially. This 
is of paramount importance in numerical evolution 
schemes, in which small initial violations of the con¬ 
straints must be allowed for and hence the conse¬ 
quences of these violations need to be controlled. 
For a recent and mathematically more thorough 
discussion of the Cauchy problem we refer to James 
Isenberg’s survey |5S] . 

Finally we wish to substantiate our earlier claim 
that any E can carry some initial data. Let us show 
this for closed E. To this end we choose a matter 


model such that the right-hand side of (144b) van¬ 


ishes. Note that this still allows for arbitrary cos¬ 
mological constants since TA(n, e^) oc (/(n, Ca) = 0. 
Next we restrict to those pairs (h, K) were K = Xh 
for some constant A. Geometrically this means 
that, in the spacetime to be developed, the Cauchy 
surface will be totally umbilical (isotropic Wein- 
garten map). Due to this proportionality and the 
previous assumption the vector constraint (144b) 
will be satisfied. In the scalar constraint we have 
G{K, K) = G{Xh, Xh) = — 6 A^ so that it will be 
satished provided that 


— eScal^ = 2«:T(n, n) — 6 A^ 


(151) 


For the following argument the Lorentzian signa¬ 
ture, e = —1, will matter. For physical rea¬ 
sons we assume the weak energy condition so that 
KT(n,n) > 0, which makes a positive contribution 
to the right-hand side of (151). However, if we 


choose the modulus of A sufficiently large we can 
make the right-hand side negative somewhere (or 
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everywhere, since E is compact). Now, in dimen¬ 
sions 3 or higher the following theorem of Kazdan 
& Warner holds ([9T], Theorem 1.1): Any smooth 
function on a compact manifold which is negative 
somewhere is the scalar curvature for some smooth 
Riemannian metric. Hence a smooth h exists which 
solves (151) for any given T(n, n) > 0, provided we 
choose > |A| sufficiently large. If E is not closed 
a corresponding theorem may also be shown [118j . 

The above argument crucially depends on the 
signs. There is no corresponding statement for pos¬ 
itive scalar curvatures. In fact, there is a strong 
topological obstruction against Riemannian met¬ 
rics of strictly positive scalar curvature. It follows 
from the theorem of Gromov & Lawson ( |78j , The¬ 
orem 8.1) that a 3-dimensional closed orientable E 
allows for Riemannian metrics with positive scalar 
curvature iff its prime decomposition consists of 
prime-manifolds with finite fundamental group or 
“handles” x 5^. All manifolds whose prime list 
contains at least one so-called K{tt, l)-factor (a 3- 
manifold whose only non-trivial homotopy group is 
the first) are excluded. See, e.g., [67] for more ex¬ 
planation of these notions. We conclude that the 
given argument crucially depends on £ = — 1. 


6.1 A note on slicing conditions 

The freedom in choosing the lapse and shift func¬ 
tions can be of much importance, theoretically and 
in numerical evolution schemes. This is particu¬ 
larly true for the lapse function a, which deter¬ 
mines the amount of proper length by which the 
Cauchy slice advances in normal direction per unit 
parameter interval. If a singularity is to form in 
spacetime due to the collapse of matter within a 
bounded spatial region, it would clearly be advan¬ 
tageous to not let the slices run into the singularity 
before the outer parts of it have had any chance 
to develop a sufficiently large portion of spacetime 
that one might be interested in, e.g. for the study 
of gravitational waves produced in the past. This 
means that one would like to slow down a in regions 
which are likely to develop a singularity and speed 
up a in those regions where it seems affordable. 
Take as an example the “equal-speed” gauge a = 1 


and /3 = 0, so that g = —c^ df^ + h. This means 
that n = d/dct is geodesic. Taking such a gauge 
from the t = 0 slice in the Schwarzschild/Kruskal 
spacetime would let the slices run into the singu¬ 
larity after a proper-time of t = ttGM/c^, where 
M is the mass of the black hole. In that short pe¬ 
riod of time the slices had no chance to explore a 
significant portion of spacetime outside the black 
hole. 

A gauge condition that one may anticipate to 
have singularity-avoiding character is that where a 
is chosen such that the divergence of the normal 
field n is zero. This condition just means that the 
locally co-moving infinitesimal volume elements do 
not change volume, for L^dfi = (V • n) d/i, where 
d/i = det{da&}d^a; is the volume element of E. 
From ( |7^ we see that n has zero divergence iff 
K has zero trace, i.e. the slices are of zero mean- 
curvature. The condition on a for this to be pre¬ 
served under evolution follows from 


0 = L^{h‘^‘>Kab) = -K‘^‘’L^hab + h’^'^L^Kab . 

(152) 


Here we use (801 to eliminate Lnhab in the first 


term and (131) to eliminate LnKab in the second 


term, also making use of (1351. This leads to the 
following equivalent of ( 152[): 


/\ha + £{mc{n,n) + K‘^'^Kab)a = {). (153) 

This is a linear elliptic equation for a. The case 
of interest to us in GR is £ = — 1. In the closed 
case we immediately deduce by standard argu¬ 
ments that q; = 0 is the only solution, provided 
the strong energy-condition holds (which implies 
Ric(n,n) >0). In the open case, where we might 
impose a —)■ 1 as asymptotic condition, we de¬ 
duce existence and uniqueness again under the as¬ 
sumption of the strong energy condition. Hence we 
may indeed impose the condition h°‘^Kab = 0, or 
Tr(Wein) = 0, for non-closed E. It is called the 
maximal slicing condition or York gauge [m]. 

Whereas this gauge condition has indeed the de¬ 
sired singularity-avoiding character it is also not 
easy to implement due to the fact that at each new 
stage of the evolution one has to solve the elliptic 


equation (153). For numerical studies it is easier 
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to implement evolution equations for a. Such an 
equation is, e.g., obtained by asking the time func¬ 
tion (36) to be harmonic, in the sense that 


0 = Ogt := 


(154) 


= |det{5a/3}| 

This is clearly just equivalent to 

= 0, (155) 


which can be rewritten using (651 and (661 to give 
da 


a : = —— = Lga — eK^a 
cdt ^ 


= Lpa + Tr(Wein 


a „,2 

2 


(156) 


a 


This is called the harmonic slicing condition. Note 
that we can still choose /3 = 0 and try to determine 
a as function of the trace of Wein. There also 
exist generalizations to this condition where c? on 
the right-hand side is replaced with other functions 

/(a)- 


6.2 A note on the DeWitt metric 


At each point p on S the DeWitt metric (118) can 


be regarded as a symmetric bilinear form on the 
space of positive-definite inner products h of TpS. 
The latter is an open convex cone in TpS 
We wish to explore its properties a little further. 




A frame in TpE 


induces a frame in Tp S i 


Tp*E 


(tensor product of the dual frame). If hah are the 
components of h then we have the following repre¬ 
sentation of the generalized De Witt metric 


Gix)=Gf^fdhab(^dhad, (157a) 

where 

Gf^f = i -t - 2A . (157b) 


Here we introduced a factor A in order to 
parametrize the impact of the negative trace term. 
We also consider E to be of general dimension n. 


The inverse metric to (157) is given by 


= G 


— 1 


d 


d 


(A) W<^bcd 


cd 


(158a) 


where 

^(A) abed ~ 2 “b ^ad^hc 2/i liafc/icrf) ■ 

(158b) 

The relation between A and g, is 

\ + g = nXg , (159) 


so that 


Gtlr^W an.cd = i i^cS’^d + m • ( 160 ) 

In ordinary GR n = 3, A = 1, and g = 1/2. 
Note that there are good reasons the keep the su¬ 
perscript — 1 even in component notation, that is, 
to write rather than just G,^x)abcd, since 

G(\)abcd KkhbihcmhdnG'l^^ unless 

A = 2/n, in which case X = g. 

If we change coordinates according to 


T : = In(^[det{/lab}] ") > 

1 

ab • — /^ab/[det}/lab}] , 


(161) 


where r parametrizes conformal changes and Tab 
the conformally invariant ones, the metric (157) 
reads 


G(a) =n(l — Xn)dT®dT+r°'‘^r^'^drab®dr(,d, (162) 

where r°‘'^rnb = S^. Since h is positive definite, so is 
r. Hence the second part is positive definite on the 
(ln(n-|-1) — l)- dimensional vector space of trace- 
free symmetric tensors. Hence the DeWitt metric 
is positive definite for A < 1/n, Lorentzian for A > 
1/n, and simply degenerate (one-dimensional null 
space) for the critical value A = 1/n. In the GR 
case we have A = 1 and n = 3, so that the DeWitt 
metric is Lorentzian of signature (—, -I-, -f, -|-, -I-, -I-). 
Note that this Lorentzian signature is independent 
of e, i.e. it has nothing to do with the Lorentzian 
signature of the spacetime metric. 
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In the Hamiltonian formulation it is not G but 
rather a conformally related metric that is impor¬ 
tant, the conformal factor being detj/iaf,}. If we 
set 

G^^y.= [det{hab}]"^^ (163) 

and correspondingly 

:= [det{h,b}]-'/"G(-;), (164) 


we can again write Gy) in terms of (r, Tab)- In fact, 
the conformal rescaling clearly just corresponds to 
multiplying (1621 with ^/det{hab} = Set¬ 

ting 

r:=4[(l-uA)/u]^^^e’"^/^ (165) 

we get, excluding the degenerate case A = 1/n, 


Gy) = sign(l — nX) dT ® dT 
+ T^G drab 0 dvcd , 


(166) 


where G = n/(16|l — nA|) (= 3/32 in GR). This 
is a simple warped product metric of K+ with 
the left-invariant metric on the homogeneous space 
GL(3, IR)/5'0(3) X M+ of symmetric positive defi¬ 
nite forms modulo overall scale, the warping func¬ 
tion being just if T is the coordinate on K+. 
Now, generally, quadratic warped-product metrics 
of the form zLdT 0 dT + T'^g, where g is indepen¬ 
dent of T, are non-singular for T 0 iff g is a 
metric of constant curvature ±1 (like for a unit 
sphere in M", with T being the radius coordinate, 
or the unit spacelike hyperboloid in n-dimensional 
Minkowski space, respectively). This is not the 
case for (166), which therefore has a curvature sin¬ 
gularity for small T, i.e. small det{/iab}. Note that 
this is a singularity in the space of metrics (here 
at a fixed space point), which has nothing to do 
with spacetime singularities. In the early days of 
Canonical Quantum Gravity this has led to specu¬ 
lations concerning “natural” boundary conditions 
for the wave function, whose domain is the space of 
metrics [S2]. The intention was to pose conditions 
such that the wave function should stay away from 
such singular regions in the space of metrics; see 
also |92| for a more recent discussion. 


We stress once more that the signature of the 
DeWitt metric is not related to the signature of 
spacetime, i.e. independent of e. For example, for 
the GR values A = 1 and n = 3, it is Lorentzian 
even if spacetime were given a Riemannian met¬ 
ric. Moreover, by integrating over E, the pointwise 
metric (166) defines a bilinear form on the infinite 
dimensional space of Riemannian structures on E, 
the geometry of which may be investigated to some 
limited extent [73 in]- 


7 Constrained Hamiltonian 
systems 

In this section we wish to display some charac¬ 
teristic features of Hamiltonian dynamical systems 
with constraints. We restrict attention to finite- 
dimensional systems in order to not overload the 
discussion with analytical subtleties. 

Let Q be the n-dimensional configuration man¬ 
ifold of a dynamical system that we locally co- 
ordinatize by (g^, • • • , q"). By TQ we denote 
its tangent bundle, which we coordinatize by 
(g^, • • • , g" , • • • , u”), so that a tangent vector 

X & TQ \s given hy X = v°^d/dq°‘. The dynamics 
of the system is described by a Lagrangian 

L-.TQ-^R, (167) 


which selects the dynamically possible trajectories 
in TQ as follows: Let M 9 t i—>■ x{t) G Q he a, (at 
least twice continuously differentiable) curve, then 
it is dynamically possible iff the following Euler 
Lagrange equations hold (we set dx/dt =: x): 


dL 


d 

dL 


dq°- 

q^x{t) 
v = ±{t) 

dt 

dv°- 

q^x{t) 
v = x(t)J 


Performing the t-differentiation on the 
term, this is equivalent to 

Hab{x{t),x(t))x^ = Va{x{t),x(t)) , 


(168) 


second 


(169) 


where 


Hab{q,v) 


d^L{q,v) 
dv<^dv^ ’ 


(170) 
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and 


are s functions </)„, a = 1, • • • , s such that 


Va(g,v) := 


dL(q,v) d^L(q,v) 


dq°- 


dv°-dq^ 


(171) 


Here we regard H and V as function on TQ with 
values in the symmetric n x n matrices and M" 


respectively. In order to be able to solve (169) for 


the second derivative x the matrix H has to be 
invertible, that is, it must have rank n. That is 
the case usually encountered in mechanics. On the 
other hand, constrained systems are those where 
the rank of H is not maximal. This is the case we 
are interested in. 

We assume H to be of constant rank r < n. 
Then, for each point on TQ, there exist s = {n — 
r) linearly independent kernel elements 7f(a)(9,u), 
a = I,-- - ,s, such that K?Jq,v)Hab{q,v) = 0. 
Hence any solution x{t) to ( |169 ) must be such that 
the curve t !->■ {x{t), x{t)) in TQ stays on the subset 


C := {{q,v) G TQ : tpa{q,v) =0, a = 1, • • • ,s}, 

(172a) 

where 

'ipaiq,v) = K^^-j{q,v)Vaiq,v). (172b) 

We assume C C TQ to be a smooth closed subman¬ 
ifold of co-dimension s, i.e. of dimension 2n — s = 
n + r. 

Now we consider the cotangent bundle T*Q over 
Q. On T*Q we will use so-called canonical coor¬ 
dinates, denoted by - ,q^,Pi,--- ,Pn\, the 

precise definition of which we will give below. The 
Lagrangian defines a map FL : TQ —>■ T*Q, which 
in these coordinates reads 

FL(g,?;) = (^,p := —• (1^3) 

From what has been said above it follows that the 
Jacobian of that map has constant rank n -I- r. 
Given sufficient regularity, we may further assume 
that 

C* := FL(C) C T*Q (174) 

is a smoothly embedded closed submanifold in 
phase space T*Q of co-dimension s. Hence there 


C* ■■= {(d,p) G T*Q : (j)aiq,p) =0, a = I,- - • ,s}. 

(175) 

This is called the constraint surface in phase space. 
It is given as the intersection of the zero-level sets of 
s independent functions. Independence means that 
at each p G C* the s one-forms d4>i\p • • • , d(j)s\p are 
linearly independent elements of T*T*Q. 

The dynamical trajectories of our system will 
stay entirely on C*. The trajectories themselves are 
integral lines of a Hamiltonian flow. But what is 
the Hamiltonian function that generates this flow? 
To explain this we first recall the definition of the 
energy function for the Lagrangian L. It is a func¬ 
tion E : TQ —>■ K defined through 

E{q,v) := - L{q,v). (176) 

At first sight this function cannot be defined on 
phase space, for we cannot invert FL to express v 
as function of q and p which we could insert into 
E{q,v) in order to get E{q,v{q,p)). However, one 
may prove the following: There exists a function 


He- :C* 

(177a) 

E = He- o FL . 

(177b) 


A local version of this is seen directly from tak¬ 
ing the differential of which yields dE = 

v°‘d{dL/dv°‘) — {dL/dq°‘)dq°‘, expressing the fact 
that dE(^g^y-j{X) = 0 if FL*(q_„)(A) = 0 for A G 
T(q,v)TQi or in simple terms: E does not vary if q 
and p do not vary. 

So far the function Tic* is only defined on C*. By 
our regularity assumptions there exists a smooth 
extension of it to T*Q, that is a function E[q : 
T*Q —)■ K such that Ho\c- = He*. This is clearly 
not unique. But we can state the following: Let Hq 
and H both be smooth (at least continuously dif¬ 
ferentiable) extensions of E[c* to T*Q, then there 
exist s smooth functions A“ : T*Q —>■ K such that 

H = Ho + . (178) 
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Locally a proof is simple: Let f : T*Q ^ M. he 
continuously differentiable and such that /|c» = 
0. Consider a point p £ C* and coordinates 
• • • y®) in a neighborhood U C 
T*Q of p, where the x’s are coordinates on the con¬ 
straint surface and the y’s are just the functions (j). 
In U the constraint surface is clearly just given by 
= • • • = y® = 0. Then 

fi d 

f\u{x,y)=J^ dt—f{x,ty) 

= I dt-^{x,ty) y“ = Xa{x,y)y°‘, 

(179a) 

where 

Xa{x,y) := [ dt^{x,ty). (179b) 

Jo dy°‘ 

For a global discussion see [5U] . 

As Hamiltonian for our constraint system we 
address any smooth (at least continuously differen¬ 
tiable) extension H of He* ■ So if Hq is a somehow 
given one, any other can be written as 


H = Hn 




(180) 


for some (at least continuously differentiable) real¬ 
valued functions A“ on T*Q. 

Here we have been implicitly assuming that the 
Hamiltonian dynamics does not leave the con¬ 


straint surface (174). If this were not the case 


we would have to restrict further to proper sub¬ 
manifolds of C* such that the Hamiltonian vector 
fields evaluated on them lie tangentially. (If no 
such submanifold can be found the theory is sim¬ 
ply empty). This is sometimes expressed by saying 
that the primary constraints (those encountered 
first in the Lagrangian/Hamiltonian analysis) are 
completed by secondary, tertiary, etc. constraints 
for consistency. 

Here we assume that our system is already dy¬ 
namically consistent. This entails that the Hamil¬ 
tonian vector-fields for the (f>a are tangential 
to the constraint surface. This is equivalent to 
X,j)^{(j)l 3 )\c* = 0, or expressed in Poisson brackets: 


{'pan — 0 , 


(181) 


for all a,l3 £ {I,-’’ )S}- Following Dirac [55] , 
constraints which satisfy this condition are said to 
be of first class. By the result shown (locally) 
above in ( [Tt^ this is equivalent to the existence 
of |s^(s — 1) (at least continuously differentiable) 
real-valued functions on T*Q, such 

that 

= (182) 


Note that as far as the intrinsic geometric proper¬ 
ties of the constraint surface are concerned (181) 
and (182) are equivalent. 

The indeterminacy of the Hamiltonian due to the 
freedom to choose any set of A“ seems to imply 
an s-dimension worth of indeterminacy in the dy¬ 
namically allowed motions. But the difference in 
these motions is that generated by the constraint 
functions on the constraint surface. In order to ac¬ 
tually tell apart two such motions requires observ¬ 
ables (phase-space functions) whose Poisson brack¬ 
ets with the constraints do not vanish on the con¬ 
straint surface. The general attitude is to assume 
that this is not possible, i.e. to assume that phys¬ 
ical observables correspond exclusively to phase- 
space functions whose Poisson bracket with all con¬ 
straints vanish on the constraint surface. This is 
expressed by saying that all motions generated by 
the constraints are gauge transformations. This en¬ 
tails that they are undetectable in principle and 
merely correspond to a mathematical redundancy 
in the description rather than to any physical de¬ 
grees of freedom. It is therefore more correct to 
speak of gauge redundancies rather than of gauge 
symmetries, as it is sometimes done, for the word 
“symmetry” is usually used for a physically mean¬ 
ingful operation that does change the object to 
which it is applied in at least some aspects (other¬ 
wise the operation is the identity). Only some “rel¬ 
evant” aspects, in the context of which one speaks 
of symmetry, are not changed. 


7.1 Geometric theory 

Being first class has an interpretation in terms of 
symplectic geometry. To see this, we first recall a 
few facts and notation from elementary symplec- 
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tic geometry of cotangent bundles. Here some sign 
conventions enter and the reader is advised to com¬ 
pare carefully with other texts. 

A symplectic structure on a manifold is a non¬ 
degenerate closed two-form. Such structures al¬ 
ways exist in a natural way on cotangent bun¬ 
dles, where they even derive from a symplectic po¬ 
tential. The latter is a one-form held 9 on T*Q 
whose general geometric dehnition is as follows: 
Let TT : T*(5 —t <3 be the natural projection from 
the co-tangent bundle of Q (phase space) to Q it¬ 
self. Then, for each p G T*Q, we dehne 


Op-po-K^p. (183) 

So in order to apply 9p to a vector X G TpT*Q, 
we do the following: Take the differential tt* of 
the projection map tt, evaluate it at point p and 
apply it to X G TpT*Q in order to push it forward 
to the tangent space Tt^(p)Q at point 7r(p) G Q. 
Then apply p to it, which makes sense since p is, 
by dehnition, an element of the co-tangent space 
at 7r(p) G Q. 

The symplectic structure, uj, is now given by 

w = —d9. (184) 


The minus sign on the right-hand side has no signif¬ 
icance other than to comply with standard conven¬ 
tions. Let us stress that 9, and hence w, is globally 
dehned. This is obvious from the global dehnition 
(1831. Therefore oj is not only closed, duj = 0, but 
even globally exact for any Q. Non-degeneracy of w 
will be immediate from the expression in canonical 
coordinates to be discussed below (cf. (196b)). 

A diffeomorphism F : T*Q ^ T*Q is called a 
canonical transformation or symplectic morphism 
if it preserves w, that is, if F*uj = uj. We explicitly 
mention two kinds of canonical transformations, 
which in some sense are complementary to each 
other. 

The hrst set of canonical transformations are 
hbre-preserving ones. This means that, for each 
q G Q, points in the hbre are moved to 

points in the same hbre 'K~^(q). This is equivalent 
to the simple equation 


TT O F = TT . ( 185 ) 


The special hbre-preserving diffeomorphisms we 
wish to mention are given by adding to each mo¬ 
mentum p G T*Q the value a{Tr{p)') of a section 
a:Q^T*Q: 

F{p) =p + cr^(p) . (186) 

This transforms the symplectic potential at p G 
T*Q into 


{F*9)p = 9Fip)oF,p 

® F{p)o{ttoF)^^ 

®A(p)o^,p (187) 

| | 186 ( 

— Pp “h O TT^p 

= 9p+{TT*a)^. 

Hence 

F*9 = 9 Ftt* a, (188a) 

F*uj = uj-7T*da. (188b) 

This is a canonical transformation if ct is a closed 
covector held on Q. By Poincare’s Lemma such a 
a is locally exact, but this need not be the case 
globally. Obstructions to global exactness are the 
hrst DeRahm cohomology class which is 

just dehned to be the vector space of closed mod¬ 
ulo exact covector helds on Q. The dimension of 
this vector space equals the rank of the free part 
of the ordinary hrst homology group i7i(Q,Z) on 
Q with integer coefficients. This latter group is 
always abelian and isomorphic to the abelianiza- 
tion of the (generally non-abelian) hrst homotopy 
group tti{Q). Hence for non-simply connected Q 
the possibility of canonical transformations exist 
which change the symplectic potential by a closed 
yet non-exact covector held. 

The second set of canonical transformations that 
we wish to mention are natural extensions to T*Q 
of diffeomorphisms of Q. These extensions not only 
leave invariant the symplectic structure w but also 
the symplectic potential 9. To see this we note that 
any diffeomorphism f : Q ^ Q has a natural lift 
to T*Q. We recall that a lift of a diffeomorphism 
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/ of the base manifold Q is a diffeomorphism F : 
T*Q ^ T*Q such that 


TT O F = f O TT . (189) 

This is equivalent to saying that the following dia¬ 
gram of maps commutes (a tailed arrow indicates 
injectivity and a double-headed arrow surjectivity) 

T*Q> - - - ^T*Q 

TT TT (190) 

w V 

Q> - 

Here the map F is just the pull-back of the inverse 
f~^. Hence the image of p G T*Q is given by 

^’(P) =po/■/«,,))• (191) 

From that it follows that the symplectic potential 
is invariant under all lifts of diffeomorphisms on Q: 


{F*9)p = 9 f^p^oF,p 

^ F{p)o {no F)^^ 

EH} zpf \ f 

F{p) o o 7r*p 

„_i „ 

— f^f{7T(p)) ° ° '^*P 

= po{f-^of)^^^^^on,p 

= dp ■ 


(192) 


So far we deliberately avoided intoducing local 
coordinates in order to stress global existence of 
the quantities in question. We now introduce con¬ 
venient coordinates in which the symplectic poten¬ 
tial and structure take on the familiar form. These 
are called canonical coordinates^ which we already 
mentioned above and the definition of which we 
now give. Let {x, U) be a local chart on Q such 
that X : Q D U ^ K" is the chart map with com¬ 
ponent functions This chart induces a chart 
{z,V) on T*Q, where V = n~^{U) C T*Q and 
z : y —>■ K^". We follow general tradition and label 


the first set of n component functions by z“ = q°‘ 
(for a = 1, • • • ,n) and the second set by z"+“ = p^ 
(for a = 1, • • • , n). For the first set we define 


g“(A) := a;“(7r(A)) 
and for the second 




(193a) 


(193b) 


7I-(A) . 


for any X £V. Note that (193b) just says that A = 
Pa(A) dx“| 7 r(A)- In this way we get a “canonical” 
extension of any chart on Q with domain [/ to a 
chart on T*Q with domain V = n~^{U). From the 
definition it is clear that 


7’‘»A 


(— 

)= — 

\dq'^ 

dx°' 


and 


that 


and 


A 


dpa 






7I-(A) 

(194a) 

= 0. 

(194b) 

the definition (1831 

Pa{X) 

(195a) 

= 0. 

(195b) 


dpa 

Hence, in canonical coordinates, the symplectic po¬ 
tential and structure take on the form 


d\v = Padq°’, 

Lo\v = dq°‘ A dp a ■ 


(196a) 

(196b) 


Note again that (1961 is valid in any canoni¬ 
cal completion of a chart on Q. As advertised 


above, it is immediate from (196bI that uj\v is 
non-degenerate at any point p G V. Since non¬ 
degeneracy is a pointwise property and valid in any 
canonical chart, it follows that w is non-degenerate 
everywhere. In the sequel we shall drop the explicit 
mention of the chart domain V. 

The non-degeneracy of uj allows to uniquely asso¬ 
ciate a vector field Xf to any real-valued function 
/ on T*Q through 


iXfUJ = df. 


(197) 
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It is called the Hamiltonian vector field of /. An 


immediate consequence of (1971 and du = 0 is that 


uj has vanishing Lie derivative with respect to any 
Hamiltonian vector field: 

LxfOJ = {ixf o d + do ixf )w = 0 . (198) 

In coordinates Xf looks like this: 

df d df d 

^ dpadq- dq-dpa' ^ ’ 

The Poisson bracket between two functions / and 
g is defined as 


{/,g} : = uj{Xf,Xg) = Xgif) = -Xf{g) (200a) 
dpa dpa dq^- ' 


( 200 b) 


It provides C°°{T*Q) with a structure of a Lie al¬ 
gebra, which means that for all f,g,h€ C°°(T*Q) 
and all a G K we have, 

{f,9} = -{gj} , ( 201 a) 

{af + g,h} = a{f,h} + {g,h} , ( 201 b) 

{/, {g, h}} + {g, {h, /}} -t {h, {/, 5 }} = 0 . 

( 201 c) 

Antisymmetry and bi-linearity are obvious from 


(200 1 . The third property (201c), called the Jacobi 


identity, can of course be directly checked using 


the coordinate expression ( 200 b), but the geomet¬ 


ric proof is more instructive, which we therefore 
wish to present here. 

The first thing we note is that the map f ^ Xf 
obeys 

Xu,a} = -[Xf,X,]. 

This follows from 

d{f,g} = d{uj{Xf,Xg)) 

= dixgixf^ 

= Lxgixf^ — iXgdixfifJ 

= HXg,Xf]U} = -iiXf,Xg]UJ, 


( 202 ) 


(203) 


where in the last step we used dixjto = 0 due to 
duj = 0 and LxfUj = 0 and once more Lx^uj = 0 in 


the first term. As u! is non-degenerate comparison 


with (1971 leads to (202). Next we recall that the 


exterior differential of a general fc-form field a, ap¬ 
plied to the fc -I- 1 vectors Xq,Xi, - ■ ■ , A^, can be 
written as 


da{Xo, • • • , Xk) 

= Y. (-1)* A,(a(Ao,--- ,A„.. 

0<i<k 

+ Y cf{[x,,x,],xo, 

0<i<j<k 


,Xk)) 


(204) 


A,; 


,Xj,--- ,Xk). 


Here the hatted entries are omitted. This 
we apply to duj{Xf,Xg,Xh) and make use 
of, e.g., Xf{uj{Xg,Xh)) = {Lxftv){Xg,Xh) + 
u;i[Xf,Xg],Xh)+ujiXg, [Af, A,,]), as well as 
Then we get, writing J2(fgh) cyclig sum over 

f,g,h: 


du;iXf,Xg,X^) = - Y ^iXf,[Xg,X^]) 
ifgh) 

® H ^{Xf,X[g^h}) 

ifgh) 

•33 El/.{»,'•}). 

ifgh) 


(205) 


Hence we see that the Jacobi identity (201c) follows 
from duj = 0. 


Equations ( |201 ) state that C°°{T*Q) is a Lie 
algebra with {•, •} as Lie product. With re¬ 
spect to the ordinary pointwise product of func¬ 
tions C°°{T*Q) is already a commutative and as¬ 
sociative algebra. Both structures are linked via 


{f ■ 9 ,h} = f ■ {g,h} + {f, h}-g, 


(206) 


which immediately follows from the Leibniz rule 


and ( 200 ) (which implies that {f-g, h} is the deriva¬ 


tive oi f ■ g along Xh). All this is expressed by 
saying that C°°(T*Q) is a Poisson algebra, mean¬ 
ing the coexistence of two algebraic structures, one 
of a commutative and associative algebra, one of 


a Lie algebra, and their compatibility via (206). 


Note that the liner space TT{T*Q) of vector fields 
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is also a Lie algebra, whose Lie product is the 
commutator of vector fields. Equation (202) then 
shows that the map from C°°{T*Q) to TT{T*Q) 
that sends f Xf is an anti homomorphism of 
Lie algebras. 

After this brief digression we now return to the 
geometric interpretation of first-class constraints. 
For any p G C* we define 


T^{T*Q) := 

{X G Tp{T*Q) :lu{X,Y) =0,yY G TpC*} . 

(207) 


The significance of this lies in the following re¬ 
sult, which we state in an entirely intrinsic geo¬ 
metric fashion. Let C* C T*Q be co-isotropic of 
co-dimension s and let e : C* —>■ T* Q he its embed¬ 
ding. We write 

w := e*a; ( 210 ) 

for the pull back of to to the constraint surface 
(i.e. essentially the restriction of w to the tan¬ 
gent bundle of the constraint surface), uj is now 
s-fold degenerate, its kernel at p G C* being just 
Tp{T*Q) C TpC*. We have the smooth assign¬ 
ment of subspaces 


The non-degeneracy of w implies that the dimen¬ 
sion of Tp {T*Q) equals s, the co-dimension of C* in 
T*Q. But note that as w is skew, T^{T*Q) might 
well have a non-trivial intersection with TpC*. This 
gives rise to the following characterizations for the 
submanifold C* C T*Q (understood to hold at each 
point p G C*): C* is called 

• isotropic iff TpC* C T:^{T*Q)-, 

• co-isotropic iff TpC* D Tjj-{T*Q); 


• Lagrangian iff TpC* = T^{T*Q). 

Since = d4>a{X^^) we see that condi¬ 

tion (181) is equivalent to the statement that the 
Hamiltonian vector-fields for the constraint func¬ 
tions (fia are tangent to the constraint hypersurface: 


X^Jc^gTTC*. (208) 

Our assumption that the s differentials d(j)a be 
linearly independent at each p G C* now implies 
that the s vectors X^^ {p) span an s-dimensional 
subspace of TpC*. But they are also elements of 
T^{T*Q) since lo{X^^,Y) = d^a{Y) = 0 for all Y 
tangent to C*. As the dimension of T^(T*Q) is s, 
this shows 


T^{T*Q) = spanjX,^, ■■■ ,X^^}c TpC* , (209) 

that is, co-isotropy of C*. First-class constraints 
are precisely those which give rise to co-isotropic 
constraint surfaces. 


C* 3 p^ kernelp(cb) = T^{T*Q) , (211) 

which forms a sub-bundle of TC* called the kernel 
distribution of a) . Now, the crucial result is that 
this sub-bundle is integrable, i.e tangent to locally 
embedded submanifolds 7* C C* of co-dimension s 
in C*, or co-dimension 2s in T*Q. Indeed, in order 
to show this we only need to show that whenever 
two vector fields X and Y on C* take values in the 
kernel distribution their commutator [X, Y] also 
takes values in the kernel distribution. That this 
suffices for local integrability is known as Frobenius ’ 
theorem in differential geometry. Writing 

= Lx{iYiy) — iY{LxCj) (212) 

we infer that the first term on the right-hand side 
vanishes because Y is in w’s kernel and Lxdi van¬ 
ishes because Lx = doix +ix ° d on forms, where 
ixCj = 0 again due to X being in the kernel and 
dCj = de*uj = e*duj = 0 due to w being closed. 

The program of symplectic reduction is now 
to form the (2n — 2s)-dimensional quotient space 
C*! where ~ is the equivalence relation whose 
equivalence classes are the maximal integral sub¬ 
manifolds of the kernel distribution of w. C*is 
called the physical phase-space or reduced space of 
states. 

We stress that this geometric formulation of 
the reduction program, and the characterization of 
C* in particular, does not refer to any set of func¬ 
tions (fa that one might use in order to character¬ 
ize C*. If one uses such functions, it is understood 
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that they obey the above-mentioned regularity con¬ 
ditions of being at least continuously differentiable 
in a neighborhood of C* and giving rise to a set of s 
linearly independent differentials d(j)a at any point 
of C*. Hence redefinitions of constraint functions 
like (j) !->■ \/\^\ or (/) !-)• albeit leading to the 
same surface C*, are a priori not allowed. 

The reduced phase space can be identified with 
the set of physical states. Smooth functions on this 
space then correspond to physical observables. But 
how can we characterize the latter without explic¬ 
itly constructing C*/ This we shall explain in 
the remaining part of this section. 

We define the gauge algebra as the set of smooth 
functions on unreduced phase space that vanish on 
the constraint surface: 


Gau:={/eC~(r*g):/|c. =0}. (213) 


This set is clearly an associative ideal with respect 
to the pointwise product. But it is also a Lie sub¬ 
algebra with respect to the Poisson bracket. To 
see this, we first remark that Xflc* is T^{T*Q)- 
valued if / S Gau. Indeed, / G Gau implies that 
kerne l(dfp) includes TpC* for p £ C*. But then 


(1971 shows Xf{p) G Tp{T*Q). Now, this im- 


mediately implies that Gau is a Lie algebra for 
co-isotropic C*, for then ( 200 [ ) implies {/,g}|c* = 
uj{Xf,Xg)\c* = 0. Hence Gau is a Poisson sub¬ 
algebra of C°°(T*Q) and also an associative ideal 
with respect to pointwise multiplication. However, 
it is not a Lie-ideal with respect to {•,•}. Indeed, 
if / G Gau we just need to take a g G C°°(T*Q) 
which is not constant along the flow of Xf\c*', 
then {f,g}\c* ^ 0. This means that we can¬ 
not define physical observables by the quotient 
C°° {T* Q)/Gau, since this will not result in a Pois¬ 
son algebra. As we insist that all elements in Gau 
generate gauge transformations, we have no choice 
but to reduce the size of C°°{T*Q) in order to 
render the quotient a Poisson algebra. Econom¬ 
ically the most effective possibility is to take the 
Lie-idealizer of Gau in C°°{T*Q), which is defined 
as follows 


/Gau := {/ G C°°{T*Q) : {f,g}\^, = OV 5 G Gau}. 

(214) 


Note that /cau is the set of smooth functions 
that, to use a terminology introduced by Dirac [KH], 
weakly (Poisson) commute with the constraints (i.e. 
with Gau). Here weak (Poisson) commutativity 
means that the Poisson brackets of observables and 
constraints need not vanish globally, i.e. on T*Q, 
but only after restriction to C*. We will briefly 
come back to the case of strong (Poisson) commu¬ 
tativity below. 

Now, if Icau/Gau is to make sense as Poisson al¬ 
gebra of physical observables Jcau must be a Pois¬ 
son algebra containing Gau as an Poisson ideal. 

That /cau is an associative algebra under point- 
wise multiplication immediately follows from (206). 
That it is also a Lie subalgebra follows from the 
Jacobi identity (201cI. Indeed, let f,g £ /cau and 
h £ Gau; then (201c) immediately gives 


= -{M,/}|c.-{{^.5}|c. =0- (215) 


Hence we have shown that /cau is a Poisson sub¬ 
algebra of C°°{T*M) which contains Gau as a Pois¬ 
son ideal. By construction /cau is the largest 
Poisson subalgebra of C°°{T*M) with that prop¬ 
erty. Hence we may identify the Poisson algebra 
of physical observables, or reduced space of observ¬ 
ables with the quotient 


Ophys Jcau/Gau. (216) 


This complements the definition of the reduced 
space of physical states. Again we stress that the 
definition given here does not refer to any set of 
functions (pa that one might use in order to char¬ 
acterize C*. 

We also stress that instead of the Lie idealizer 
(214) we could not have taken the Lie centralizer 


Gcau := {/ G G~(T*Q) : {f,g} = 0 V 5 G Gau}. 

(217) 

Note that here the only difference to ( |214| ) is that 
{f,g} is required to vanish strongly (i.e. on all of 
T*Q) and not only weakly (i.e. merely on C*). This 
makes a big difference and the quotient Icau/Gau 
will now generally be far too small. In fact, it is 


34 













intuitively clear and also easy to prove (see, e.g., 
Lemma 5 of m) that the Hamiltonian vector fields 
Xf corresponding to functions / G Gau, which 
span T^{T*C) C TpC* at each p £ C, span all 
of Tp(T*Q) for each point p off C*. This implies 
that smooth functions in C°° {T* Q) which strongly 
(Poisson) commute with Gau are locally constant 
outside C*. Sometimes strong (Poisson) commuta¬ 
tivity is required not with respect to Gau but with 
respect to a complete set {^i, • • • , (/)s} of functions 
in Gau defining C*; for example the component 
functions of the momentum map (see next subsec¬ 
tion). But even then strong commutativity is too 
strong, as a smooth function commuting with all 
(pa on C* need generally not extend to a smooth 
function defined in a neighborhood of C* in T*Q 
which still commutes with all (pa.. The reason is 
that the leaves of the foliation defined by the pa 
may become ‘wild’ off C*. Compare, e.g., the dis¬ 
cussion in [3^ (including the example on p. 116). 


7.2 First-class constraints from zero 
momentnm-maps 

First class constraints often arise from group ac¬ 
tions (see Appendix for group actions). This is 
also true in GR, at least partially. So let us ex¬ 
plain this in more detail. Let a Lie group G act 
on the left on T*Q. This means that there is a 
map G X T*Q —?> T*Q, here denoted simply by 
i9,p) 9 -P, so that gi ■ {g 2 ■ p) = ( 3152 ) • P and 

e-p = p\ie&G\s the neutral element. As al¬ 


ready seen earlier in (24) and explained in detail in 


the Appendix, there is then an anti-homomorphism 
from Lie(G), the Lie algebra of G, to the Lie alge¬ 
bra of vector fields on T""Q. Recall that the vector 
field corresponding to X G Lie(G), evaluated 
at point p G T*Q, is given by 


V^(p) 


_d 

dt 


exp(tX) ■ p. 

t=o 


(218) 


Then 

=-H[xy]. (219) 

Let us further suppose that the group action on 
T*Q is of a special type, namely it arises from a 


group action on Q by a canonical lift. (Every diffeo- 
morphism f of Q can be lifted to a diffeomorphism 
F of T*Q given by the pull back of the inverse 
/“^.) Then it is easy to see from the geometric 


definition (183) that the symplectic potential 9 is 


invariant under this group action and consequently 
the group acts by symplectomorphisms (w preserv¬ 
ing diffeomorphisms). The infinitesimal version of 
this statement is that, for all X G Lie(G), 


LyxO = 0 . 


( 220 ) 


Since Lyx = iyx od + do iyx this is equivalent to 

iyxuj = d{9{V^)) (221) 

which says that is the Hamiltonian vector field 
of the function 9(V^). We call the map 

Lie(G) 9 X P(X) := 0(V^) G G“(r*Q) 

( 222 ) 

the momentum map for the action of G. It is a 
linear map from Lie(G) to G°°{T*Q) and satisfies 


{P(X),P(r)} = H^(0(H^)) 


= {Lyx9)(V^)+9{LyxV^) 
= 9{V^^’^^) 

= P{[X,Y]), 


(223) 


where we used (220) and (219) for the third equal¬ 


ity. Hence we see that the map (222) is a Lie ho¬ 


momorphism from Lie(G) into the Lie algebra of 
smooth, real-valued functions on T*Q (whose Lie 
product is the Poisson bracket). 

Now, first class constraints are often given by 
the condition of zero momentum mappings, i.e., by 
P(X) = 0 for all X G Lie(G). By linearity in 
X, this is equivalent to the set of s := dim(G) 
conditions 

Pa ■■= Piea) = 0 , (224) 

where = {ei, • • • ,6^} is a basis of Lie(G). Let 
the structure constants for this basis be GW, i.e. 


[caXp] = ^apOj, then (223) becomes 


{Pa,Pl3} = Clppj . 


(225) 
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Constraints in gauge theories will typically arise 
as zero momentum maps in the fashion described 
here, the only necessary generalization being the 
extension to infinite-dimensional groups and Lie al¬ 
gebras. In fact, for gauge theories our G will cor¬ 
respond to the infinite-dimensional group of gauge 
transformations, which is not to be confused with 
the finite-dimensional gauge group. The former 
consists of functions, or sections in bundles, with 
values in the latter. On the other hand, the con¬ 
straints in GR will only partially be of this type. 
More precisely, those constraints arising from 3- 
dimensional diffeomorphisms (called the vector or 
diffeomorphism constraints) will be of this type, 
those from non-tangential hypersurface deforma¬ 
tions (scalar or Hamiltonian constraint) will not fit 
into this picture. For the former G will correspond 
to Diff(E), or some appropriate subgroup thereof, 
and Lie(Diff (S)) to the infinite-dimensional Lie al¬ 
gebra of vector fields on E (possibly with spe¬ 
cial support and/or fall-off conditions). The differ¬ 
ent nature of the latter constraint will be signaled 
by structure functions G2,p{q,p) appearing on the 
right-hand side rather than constants. This has 
recently given rise to attempts to generalize the 
group-theoretic setting described above to that of 
groupoids and Lie algebroids, in which the more 
general structure of GR can be accommodated [331 • 


8 Hamiltonian GR 


The Hamiltonian formulation of GR proceed along 
the lines outlined in the previous section. For this 
we write down the action in a (3-|-l)-split form, 
read off the Lagrangian density, define the con¬ 
jugate momenta as derivatives of the latter with 
respect to the velocities, and finally express the 
energy function (1761 in terms of momenta. The 


constraint functions will not be determined on the 
Lagrangian level, but rather directly on the Hamil¬ 
tonian level as primary and secondary constraints 
(there will be no tertiary ones), the primary ones 
being just the vanishing of the momenta for lapse 
and shift. 

The Lagrangian density for GR is essentially just 


the scalar curvature of spacetime. However, upon 
variation of this quantity, which contains second 
derivatives in the metric, we will pick up bound¬ 
ary terms from partial integrations which need not 
vanish by just keeping the metric on the boundary 
fixed. Hence we will need to subtract these bound¬ 
ary terms which will otherwise obstruct functional 
differentiability. Note that this is not just a mat¬ 
ter of aesthetics: Solutions to differential equations 
(like Einstein’s equation) will not be stationary 
points of the action if the latter is not differentiable 
at these points. Typically, Euler Lagrange equa¬ 
tions will allow for solutions outside the domain 
of differentiability of the action they are derived 
from. Including some such solutions will generally 
need to adapt the action by boundary terms. This 
clearly matters if one is interested in the values of 
the action, energies, etc. for these solutions and 
also, of course, in the path-integral formulations of 
the corresponding quantum theories. 

The Einstein-Hilbert action of GR is 

5'gr[H,5i] = — — / Seal dfj,g + boundary terms 
2k Jq 

(226) 

where in local coordinates = {x° = 

ct, x^), 

dpLg = edLet{g^,j} cdt A dx^ A dx"^ A dx^ . (227) 
The sign convention behind the prefactor —e in 


(226) is such that in the Lorentzian as well as the 


Riemannian case the Lagrangian density contains 
the bilinear De Witt inner product of the extrinsic 


curvatures (compare (141)) with a positive sign, i.e. 


transverse traceless modes have positive kinetic en¬ 
ergy. 


The boundary term can be read off (141) and 


(139). If the integration domain H C M is such 


that the spacelike boundaries are contained in two 
hypersurfaces Eg, i.e. two t = const, surfaces, say 
dLli := do. n Einitiai and dLlf := dVl n Egnai, we 
would have to add the two boundary terms (de¬ 
pendence on e drops out) 


[ TTh{K)dfih-K ^ f TihiK) dfj.h ■ 
JdQf JdQi 

(228) 
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Here we used that the second term in (1391 does 


not contribute due to a being orthogonal to n. duh 
is the standard measure from the induced metric, 
h, on the hypersurfaces. If the cylindrical timelike 
boundary Sflcyi is chosen such that its spacelike 
normal m is orthogonal to n, only the second term 
in ( |139 ) contributes and we get one more boundary 
term (again e drops out) 


' or 2 cyi 


K{n,n)dnh ■ 


(229) 


Here K is the extrinsic curvature of Sflcyi in 
M, which we picked up because g{a, m) = 
g{Vnn, to) = -g{n, Vnm) = K{n, n). 

Once the boundary terms are taken care of we 


can just read off the Lagrangian density from (1411 


also using (66), 


CGn = (2 k)-^[G{K,K) - eR]aVh, (230) 
where we now used the standard abbreviations 


GiK,K) 

R 

Vh 




ab^cd 5 


= Seal 


D 


(231) 


= yjdet{hab} ■ 


Moreover, Kab has here to be understood as ex¬ 
pressed in terms of the time and Lie derivatives of 

^ab- 

(h-Lph). (232) 


K = -^a- 


We keep in mind that an overdot denotes differen¬ 
tiation with respect to ct (not t). In passing we also 
note that £gr has the right physical dimension of 
an energy-density (a is dimensionless). 

The Hamiltonian density is now obtained by the 
usual Legendre transform with respect to all con¬ 
figuration variables that are varied in the action. 
These comprise all components and hence in 
the (3-|-l)-split parametrization all hab as well as 
the lapse a and the three shift components /3°‘. 
However, it is immediate that (230) does not con¬ 


tain any time derivatives of the latter; hence their 


conjugate momenta vanish: 

1 


= 0 , 


^Q! ■- ■ cv . 

c aa 
1 dCcK 

TTfla :=-^— = 0 . 

c a/ja 


(233a) 

(233b) 


This leaves us with the momenta for the metric 
components hab 


^o-b . 

TT : = — 


1 5£gr 

C dhab 

[-e)y/h GabcdR^^ (234) 
2kc 

( rbabed 


2kc 




Here again K stands for the expression (2321. We 


also made use of the conformally rescaled De Witt 


metric (163) whose significance appears here for 


the first time. Again in passing we note that the 
physical dimension of 7r“^ is right, namely that of 
momentum per area (the dimension of K is an in¬ 
verse length). 

In order to compute the Hamiltonian density we 
express h in terms of the momenta 


hab — {hjf5h)ab 2j£C^Kab 

= {DaPb + DbPa) + 4q:kc 

and obtain 

Uolh, tt] = 7r“'’ chab - Cqr 

1 ^ab^cd 


(235) 


= a 

+ e(2K)~^'/hR 
+ 2cn^^DaPb . 


(236) 


The Hamiltonian, Hq, is just the integral of this 
density over S. The subscript 0 is to indicate that 
this Hamiltonian is still to be modified by con¬ 
straints according to the general scheme. Also, we 
have to once more care about surface terms in order 
to ensure functional differentiability without which 
the Hamiltonian flow does not exist m- 
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The first thing to note is that we have found the 


in (147). In fact, it is small enough to just touch 


primary constraints (2331. For them to be main¬ 


tained under the evolution we need to impose 


I fjx n 

CTTq ={7ra,Ho} = —^=0, 

da 

CTT^a = = 0 , 

giving rise to the secondary constraints 
(2Kc^)G'“^(7r, tt) -I- e{2K)~^\/hR = 0 , 


(237a) 

(237b) 


boundaries of the positive and negative octants in 
This means that |cos0| > 1 / 2/3 implies that 
either all Pa are positive or all Pa are negative. In 
other words (/“^(tt, tt) < 0 implies that the sym¬ 
metric bilinear form tt is either positive or negative 
definite. In contrast, (147) did not allow to con- 


(238a) 
(238b) 

respectively. It may be checked directly that these 


- = 0 , 


elude definiteness of the symmetric bilinear form 
K from G{K,K) < 0, since the interiors of the 
double-cone |cos0| > y^ 1/3 intersect the comple¬ 
ments of the definite octants. 

Let us now return to the constraints. We have 


found the primary and secondary constraints (233) 


and (238) respectively. The most important thing 


equations respectively are equivalent to (144) for 
T = 0. If we had included a cosmological con¬ 
stant this would have led to the replacement of R 


to note next is that there will be no further (ter¬ 
tiary, etc.) constraints. Indeed, this follows from 
the general argument following (150), which en¬ 


in (238a) with (i? — 2A). 

In passing we make the following geometric ob¬ 
servation regarding the bilinear form G'“^(7r, tt), 
which is analogous to that made for G{K, K) below 
equation ( |145 ). We can choose a local frame {cq} 
so that hab = h{ea, Cb) = Sab and 7r“^ = 7r(0“, = 

diag(pi,p 2 jP 3 )) where {9°'} is dual to {ca}- Then 


sures the preservation of the secondary constraints 
under Hamiltonian evolution. The primary con¬ 
straints are now taken care of by simply eliminating 
the canonical pairs (a,7rQ,) and (/3“,7r^a) from the 
list of canonical variables. As we will see shortly. 


G /7r,7r) := (<5“'’ - \n°^v!’)paPb , 


(239) 


where, as before, n“ are the components of the 
normalized vector n := (l,l,l)/-/3 in eigenvalue- 


the secondary constraints (238) are of first-class, 
so that, according to the general theory outlined 
above, they should be added with arbitrary coef¬ 
ficients to the initial Hamiltonian Hq to get the 
general Hamiltonian. This leads to 

H[a,f3] = Cs(a)-|-C'„(/3)-|-boundary terms, (241) 


space, which we identify with endowed with the 

standard Euclidean inner product. Denoting again 

where 

r 




by 9 the angle between ft and p := {pi,P 2 ,P 3 ), we 

Csia) : = 

/ cPx a 

{2kc^)G ^(tt. 

tt) 


have 


Jy. 




CO 

II 

CO 

0 

0 

0 



-1- e(2/c) ^VhR 

, (242a) 

G ^(tt, tt) = < > 0 if cos 6* < a/2/3 

G„(/3) : = 

/ Sx /3“ 

-2chabDcTr^’' 

7 

(242b) 

[<0 if cos6* > •/2/3. 


Jy 





(240) 

This should be compared with (147). The differ¬ 
ence is that a/ 1/3 is replaced by \/2/3, which is 
due to A = 1 in ( |157b| ) but p = 1/2 in (158b) in 
the GR case. This has an interesting consequence: 
The condition | cos 9\ = y/2/3 now describes a dou¬ 
ble cone around the symmetry axis generated by n 
and vertex at the origin, whose opening angle is 
strictly smaller than that of the cone considered 


where a and are now arbitrary coefficients cor¬ 


responding to the A’s in (180). In particular, they 


may depend on the remaining canonical variables h 
and TT. Note that up to boundary terms the Hamil¬ 
tonian is just a sum of constraints, where S stands 
for the scalar- (or Hamiltonian-) and and V for the 
vector (or diffeomorphism-) constraint. 

The equations of motion generated by H will 
clearly be equivalent to (149) and (148). Let us 
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write them down explicitly. To do this we first 
note that the functional derivatives of Cv{/3) with 
respect to h and tt are easily obtained if we note 
that, modulo surface terms, the integrand can ei¬ 
ther be written as c(Lph) or as —c(L,g7r)“^/ia{,. 
Hence, given that the surface terms are so chosen 
to guarantee functional differentiability, we have 

=cLphab=C {DaPb + DbPa) (243) 


contravariant components of the Einstein tensor for 
h. Taken together we get 


SC,(a) 

Shab 


Kc^a 




2k - 


aG‘^\h) - 


V~h 


(248) 


and 


SC^fS) 

Shab 


= -c{LpTT) 


ah 


= - c [G,(/3%“^) - - (^c/?'')7r“] . 

(244) 

Note that in the first term on the right-hand side of 
(2441 the /3'^ appears under the differentiation Dc 
because tt is a tensor-density of weight one. The 
functional derivative with respect to tt of Gs(q;) is 
simply 4aKC^G(7r, •), so that the equation of mo¬ 
tion for h is readily written down: 


^hab — — 


= 4:aKc‘‘Gj^j^TT’'‘^ -f cL/shab ■ 


SH 

Stt°-^ 

cd 


(245) 


Using (234) this is immediately seen to be just 
( |149| ). From the explicit h-dependence of G, as 
displayed in (|158b ) and (163), we obtain, using 
dVh/dhab = lVhh°-'^, 


dG (tt, tt) li,ab/b—l/_ I iM,an^bm/^—l ^cd 
- ^ - = -2^ ^ (7r,7r)-t2/r tt . 

(246) 

For the variational derivative of the second term 
in (242) we use the following standard formula for 
the variation of Vh times the scalar curvature of h 


6{VhR{h)) 

= Vh{-G‘^\h)Shab + G^”^<^^D^Dm5Kb) , 


(247) 


which immediately follows from ( 117[ ), where we 
recall that gab there corresponds to hab here and 
also that = —h°‘^h^^5hcd- G°‘^{h) denote the 


With (244) this gives the second Hamilton equation 
SH 


• ah ur'l 

CTT =| 7 r \ = — 


ah 


Sha 


= nc a 
e 


/r“'’G-^(7r,7r) -4h“V™Gr^ 


cd 


2k . 

+ cLfl7r“^ 


aG“^(/i) - G^’^'^'^DaDraa 


Timed ‘ 


Vh 


(249) 


These are seen to be equivalent to (149) and 


(148) respectively. Before we discuss the bound¬ 
ary terms we write down the Poisson brackets for 
the constraints. 


{G„(/3),G„(/3')} =a([/3,/3']), (250a) 

{G„(/3),G,(a)} =G«(^(a)), (250b) 

{G,(a),Gs(Q;')} = £G„(a(da')“ - a'{da)^) . 

(250c) 

These may be obtained by direct computation, but 
are also dictated by geometry. Before discussing 
the geometry behind them, we note the following 
more or less obvious points: 

1. The vector constrains form a Lie algebra. The 
map /? —> U(/3) is a Lie homomorphism form 
the Lie algebra of vector fields in E to the Lie 
algebra (within the Poisson algebra) of phase- 
space functions. In fact, this map is just the 
momentum map for the action of the diffeo- 
morphism group G = Diff(S) on phase T*Q, 
which is a lift of the action on Q = Riem(S), 
the space of Riemannian metrics on E. Note 
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that here the symplectic potential can be writ¬ 
ten in a symbolic infinite-dimensional notation 
(cf. ( [I9^ ) 

0 = / X Tr‘^^{x) Shab{x) (251) 

JT, 

and the vector field Vg generated by the action 
of G = Diff(S) on Q = Riem(S): 

Vp = [ d^x Lphab{x )-^—. (252) 

7s otiab{x) 

The momentum map (222) is then given by 


Pf3 = S{Vp) = [ d^X TT^'^Lphab 

7s 

= C-lG„(/3)+2 / d^PaTT’^^Ub, 

Jan 


(253) 


where denote the components of the out¬ 
ward pointing normal of dT,. This shows that 
for vector fields /3 for which the surface term 
does not contribute the vector constraint is 
just the momentum map (up to a factor of 
c~^, which comes in because the physical di¬ 
mension of the values of the momentum map 
are that of momentum whereas the physical 
dimension of the constraints are that of an 
Hamiltonian, that is, energy). The surface 
term will be discussed below. What is im¬ 
portant here is that the vector constraint coin¬ 
cides to the zero momentum-map for those dif- 
feomorphisms which are asymptotically triv¬ 
ial, i.e. for which the surface term vanishes. 
Only those are to be considered as gauge trans¬ 
formations! Long ranging diffeomorphism for 
which the surface term is non zero, i.e. for 
configurations of non vanishing linear and/or 
angular momentum (cf. Section [^, have to 
be considered as proper changes in physical 
state. If we required these motions to be pure 
gauge we would eliminate all states with non¬ 
zero asymptotic charges. Compare the closing 
remarks of Section[7l 


2. Once we have understood that the vector con¬ 
straint is the momentum map for diffeomor- 
phisms, its Poisson bracket with any other 


phase-space function F that defines a geomet¬ 
ric object on S (i.e. an object with well de¬ 
fined transformation properties under diffeo- 
morphisms) is fixed. We simply have 


{F,Vfi}=L0F. (254) 


In this sense (250b) says no more than that 
the expression (238a) is a scalar density of 
weight one. Recall that if f is a scalar den¬ 
sity of weight one then L/jT" = i7a(/3“F). If 
we multiply F hy a and integrate over S we 
get after partial integration and assuming the 
boundary term to give no contribution (which 
for non closed S requires certain fall-off con¬ 
ditions) an integral of —F/3{a), which is just 
what (250b) expresses. Algebraically speak¬ 
ing, the fact that the Poisson bracket of a vec¬ 
tor and a scalar constraint is proportional to a 
scalar rather than a vector constraint means 
that the vector constraints do not form an 
ideal. Geometrically this means that the 
Hamiltonian vector fields for the scalar con¬ 
straint, if evaluated on the hypersurface for the 
vector constraint, will generally not be tan¬ 
gential to it, except for the points where this 
hypersurface intersects that of the scalar con¬ 
straint. This has very important consequences 
for algorithms of phase-space reduction, i.e. 
algorithms that aim to “solve” the constraints. 
It means that a reduction in steps is not pos¬ 
sible, whereby one first solves for the vector 
constraint and then seeks for solutions of the 
scalar constraint within the class of solutions 
to the vector constraint. 


3. According to (250c) two scalar constraints 
Poisson commute into a vector constraint. 
Two facts are remarkable concerning the vec¬ 
tor field that forms the argument of this vector 
constraint: First, it depends on the signature 
of spacetime (overall multiplication with e). 
Second, it depends on the phase space variable 
h through the D-operation of “index raising”; 
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explicitly: 

a{da')‘^ — a' {daY 

= {adba' - a'dba) 

This is the fact already mentioned at the end 
of Section that the constraints in GR are 
not altogether in the form of a vanishing mo¬ 
mentum map. This fact has led to some dis¬ 
cussion in the past and attempts have been 
made to consider different algebraic combina¬ 
tions of the constraints which define the same 
constraint hypersurfaces but display structure 
constants rather than structure functions in 
their Poisson brackets; e.g., [53] • But as al¬ 
ready discussed in Section]^ it is important 
that these redefinitions do not spoil the reg¬ 
ularity properties of the functions that define 
the constraint surface. 


This ends the immediate discussion of (250). But 
there is another aspect that is related to the last 
point just discussed and that deserves to be men¬ 
tioned. 


8.1 Hypersurface deformations and 
their representations 

Even though the constraints cannot be understood 
in a straightforward fashion as zero momentum 
map of a group action, they nevertheless do furnish 
a representation of an algebraic object (a groupoid) 
of hypersurface motions. As a result, the relations 


(2501 are universal, in the sense that any spacetime 


diffeomorphism invariant theory, whatever its field 
content, will give rise to the very same relations 
(2501; see |112| and [HI] for early and lucid discus¬ 
sions and [HH][Hn] for a comprehensive account. 

The idea is to regard the space of (space- 
like) embeddings Emb(E, M) of E into M as an 
infinite-dimensional manifold, on which the diffeo- 
morphism group of M acts on the left by sim¬ 
ple composition. Then there is a standard anti¬ 
homomorphism from the Lie algebra of Diff(M) 
to the Lie algebra of vector fields on £(E,M), 
just as in (219). A tangent vector at a particular 


£ G Emb(E, M) can be visualized as a vector field ^ 
on E C M with normal and tangential components, 
more precisely, as a section in the pull-back bundle 
£*TM over E. Its decomposition into normal and 
tangential components depends on £. If we think 
of M as being locally coordinatized by functions 
and E by functions a:“ then £ can be locally 
represented by four functions of three variables 
a:“. A vector field can then be represented in a 
symbolic infinite-dimensional notation 


V^= [ d^xCivix)) 


5y^^{x) 


(256) 


In full analogy to (219), this immediately leads to 


[yM = -V[Gv\- ( 257 ) 


If we now decompose ^ in an embedding dependent 
fashion into its normal component an and tangen¬ 
tial component j3 we can rewrite (2561 into 


E(a,/3) = 

J^d^x (a{x)nf^[y]{x) + /3‘'day^{x)^ 5yt{x) ’ 

(258) 


where the components of the normal n to the 
image £(E) C M have to be considered as func¬ 
tional of the embedding. Again we can compute 
the commutator explicitly. The only non-trivial 
part is the functional derivative of the with re¬ 
spect to the y''. How this is done is explained in 
the Appendix of |112j . The result is 

[E(ai,/3i),E(a2,/32)] =-V{a,P), (259a) 

where 

a = j3i{a2) - hioti), (259b) 

P = [Pi,P 2 ] + e(ai{da 2 )‘^ — 02 (^ 01 )**) . (259c) 


This is just (250) up to a relative minus sign that 
has the same origin as that between (219) and 
(223). We therefore see that (250) is a representa¬ 
tion of a general algebraic structure which derives 
from the geometry of deformations of (spacelike) 
hypersurfaces in spacetime. 
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We can now address the inverse problem, namely 
to find all Hamiltonian representations of (2591 on 
a given phase space. As in GR the phase space is 
T*Q, where Q = Riem(S), That is, we may ask for 
the most general phase-space functions : 

T*Riem(E) —>• K, parametrized by {a, (3), so that 

{HiauPi),Hia2,l32)}=Hia,/3). (260) 


The meaning of this relation is once more ex¬ 
plained in Fig.|^ It is also sometimes expressed as 
path independence^ for it implies that the Hamilto¬ 
nian flow corresponding to two different paths in 
£’(!], M) reaching the same final hypersurface will 
also result in the same physical state (phase-space 
point). 



Figure 3: An (infinitesimal) hypersurface deformation 
with parameters (ai,/3i) that maps S i—Si, followed by 
one with parameters ( 02 ,/32) that map s Si 1 — » S 12 differs 
by one with parameters (a, /3) given by l |259b[ | from that in 
which the maps with the same parameters are composed in 
the opposite order. 


To answer this question one first has to choose a 
phase space. Here we stick to the same phase space 
as in GR, that is T*Q, where Q = Riem(E). The 
representation problem can be solved under certain 
additional hypotheses concerning the geometric in¬ 
terpretation of H{a = 0,/3) and H{a,l3 = 0): 

1. H{Q,j3) should represent an infinitesimal spa¬ 
tial diffeomorphism, so that 


(261a) 


H(0,f3) to be the momentum map for the ac¬ 
tion of Diff(E) on phase space. 


2. H(a,0) should represent an infinitesimal 
Diff(M) action “normal to S”. In absence of 
M, which is not yet constructed, this phrase 
is taken to mean that (80) must hold, i.e. 

{/i, H{a, 0)} = -2eaK , (261b) 


where K is the extrinsic curvature of S in 
the ambient spacetime that is yet to be con¬ 
structed. 


It has been shown that under these conditions 
the Hamiltonian of GR, including a cosmological 
constant, provides the unique 2-parameter family 
of solutions, the parameters being k and A. See 
[8T] for more details and [96] for the most com¬ 
plete proof (see below for a small topological gap). 
This result may be seen as Hamiltonian analog 
to Lovelock’s uniqueness result [SH] for Einstein’s 
equations using spacetime covariance. 

A particular consequence of this result is the 
impossibility to change the parameter A in the 
DeWitt metric (157) to any other than the GR 
value A = 1 without violating the representation 
condition, that is, without violating covariance un¬ 
der spacetime diffeomorphisms. Such theories in¬ 
clude those of Hof ava-Lifshitz type |82| , which were 
suggested as candidates for ultraviolet completions 
of GR. 

At this point we must mention a topological sub¬ 
tlety which causes a small gap in the uniqueness 
proofs mentioned above and might have important 
consequences in Quantum Gravity. To approach 
this issue we recall from the symplectic framework 
that we can always perform a canonical transfor¬ 
mation of the form 


TT I— 7rG= TT -|- 0 , (262) 

where 0 is a closed one-form on Riem(I]). Closed¬ 
ness ensures that all Poisson brackets remain the 
same if tt is replaced with tt'. Since Riem(E) is 
an open positive convex cone in a vector space and 
hence contractible, it is immediate that Q — df 


{F,H{0,(3)} = LpF 
for any phase-space function F. 


This fixes 
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for some function / : Riem(E) —>■ K. However, tt 
and tt' must satisfy the diffeomorphism constraint, 
which is equivalent to saying that the kernel of 
TT (considered as one-form on Riem(E)) contains 
the vector helds generated by spatial diffeomor- 
phisms, which implies that 0, too, must annihilate 
all those, so that / is constant on each connected 
component of the Diff(I]) orbit in Riem(E). But 
unless these orbits are connected this does not im¬ 
ply that / is the pull back of a function on the 
quotient Riem(E)/Diff(E), as assumed in [5B]. We 
can only conclude that 0 is the pull back of a 
closed but not necessarily exact one-form on su¬ 
perspace. Hence there is an analogue of the Bohm- 
Aharonov-like ambiguity that one always encoun¬ 
ters if the configuration space is not simply con¬ 
nected. The quantum theory is then expected to 
display a sectorial structure labeled by the equiva¬ 
lence classes of unitary irreducible representations 
of the fundamental group of configuration space, 
which in analogy to Yang-Mills-type gauge theo¬ 
ries are sometimes referred to as 9-sectors [57]. In 
GR the fundamental group of configuration space is 
isomorphic to a certain mapping-class group of the 
3-manifold E. The theta-structure then depends 
on the topology of E and can range from ‘trivial’ 
to ‘very complicated’. See [73] for more details on 
the role and determination of these mapping-class 
groups and |74] for a more general discussion of the 
configuration space in GR, which, roughly speak¬ 
ing, is the quotient Riem(E)/Diff(E), often referred 
to as Wheeler’s superspace mm- 

We finally note that additional theta-structures 
may emerge if the gravitational field is formulated 
by means of different held variables including more 
mathematical degrees of freedom and more con¬ 
straints (so as to result in the same number of phys¬ 
ical degrees of freedom upon taking the quotient). 
The global structure of the additional gauge trans¬ 
formations may then add to the non-triviality of 
the fundamental group of conhguration space and 
hence to the complexity of the sectorial structure. 
Examples have been discussed in the context of 
Ashtekar variables (cf. hnal Section) in connection 
with the CP-problem in Quantum Gravity m- 


8.2 An alternative action principle 

A conceptually interesting albeit mathematically 
awkward alternative form of the action principle 
for GR was given by Baierlein, Sharp, and Wheeler 
in m- Its underlying idea, as far as the initial- 
value problem is concerned, is as follows: We have 
seen that initial data {h, K) (or (h, tt)) had to obey 
four constraints (per space point) but that the four 
functions a and /3 could be specified freely. Could 
we not let the constraints determine a and jS and 
thereby gain full freedom in specifying the initial 
data? In that case we would, for example, gain 
full control over the initial geometry, whereas, as 
we will see later, the standard conformal method 
to solve the constraints only provides control over 
the conformal equivalence class of the initial ge¬ 
ometry, the representative within that class being 
determined by the solution to the scalar constraint. 
For black-hole collision data this, e.g., means that 
we cannot initially specify the initial distances. We 
will now discuss to what extent this can indeed be 
done. At the end of this subsection we will add 
some more comments regarding the conceptual is¬ 
sues associated with this alternative formulation. 

We start from the action 


<S'GRb;f^]= [ d'^xCcR, 
Jfi 


(263) 


where Eqr is as in (230). In it we express K in 
terms of h as given in (2321. This results in 

SgrIw 13, h,h; H] = 


d'^x 


2k 1 4a 


G{h — Ljsh , h — Lph) — ei?a| . 

(264) 


For fixed domain fl C M this is to be regarded as 
functional of g, that is of a, /? and h. Note that a 
enters in an undifferentiated form. Variation with 
respect to it gives 


= a*(/i, h, 13) := 


' —eG[h — Lph , h — Lph) 

- ^ 

( 265 ) 
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where we have chosen the positive root for a and 
introduced the abbreviation a* for the function 
of h, h, and /3 defined by the expression on the 
right-hand side in ( |265[ ). Note that this only 
makes sense if R has no zeros and if the sign of 
—eG{h — Lph, h — LpK) equals that of R. Hence 
we have to restrict to a particular sign for the latter 
expression. We set 


a : = sign^G'(/i — Lph , h — Lph 
= —e sign(i?) . 


(266) 


Reinserting (|265D into 
( 266| ), gives 


taking into account 


5'bsw[;5, h, h; H] := Sonia = a* , h, h; H] = 

[ d‘^x^^J-eRG{h- Lph, h-Lph). 

Jn 2 k '' 

(267) 

Here we explicitly indicated the functional depen¬ 
dence on the time derivative of h to stress the in¬ 
dependence on the time derivative of /3. With ref¬ 
erence to m this form of the action is sometimes 
called the Baierlein-Sharp-Wheeler action. (Hence 
the subscript BSW in (267).) We can now try to 
further reduce this action so as to only depend on 
h and h. For this we would have to proceed with 
P in the same fashion as we have just done with a; 


i.e., vary (267) with respect to /? and then reinsert 


the solution h\ of the ensuing variational equa¬ 
tions back into (267). The variational equations for 


P are easily obtained (note that cr drops out): 


2k SSbsw 

\/h 


SPd 


Dr 


a^{h, h, P) 


(268) 


= 0 . 


These three equations are traditionally referred to 
as the thin-sandwich equations. They are meant to 
determine P for given pairs (/i, h). 


Suppose there is a unique solution to (268) for 
given [h, h), i.e., 


P = P^{h, h) . 


(269a) 


Inserting this into (2651 determines the lapse for 
given (h, h): 


a = a*, {h, h,P = P^ (h, h)) . 


(269b) 


Our initial goal is then achieved if we consider h S 
rT 2 S (positive definite) and h G TT^E (arbitrary) 
as freely specifiable initial data. As intended, h 
represents a freely specifiable Riemannian geome¬ 
try of E and h its initial rate of change with respect 
to some formal parameter. The relation between 
this formal time parameter and proper time is fully 


determined by the solutions (269), in the way ex¬ 


plained in Section lrT] This means that the specifi¬ 
cation of two infinitesimally nearby configurations 
h and h -\- h dx° allows to deduce the proper time 
that separates the corresponding spatial slices in 
the spacetime to be constructed. In this sense, and 


subject to the solvability of (268), physically mean¬ 


ingful durations can be deduced from two infinites¬ 
imally close instantaneous configurations. This is 
why Baierlein, Sharp, and Wheeler concluded in 
PT] that “three-dimensional geometry is a carrier 
of information about time”; or in Barbour’s conge¬ 
nial dictum ([Ml, p. 2885): “The instant is not in 
time; time is in the instant”. Also, it should now be 
obvious where the term thin-sandwich comes from 


and why the problem of finding a solution to (268) 


is often referred to as the thin-sandwich problem. 

The thin-sandwich equations (268) were first 
conceived and discussed in |116j and [21]. Exis¬ 
tence and uniqueness of solutions were originally 
conjectured (henceforth known as thin-sandwich 
conjecture) , e.g. in |116j . but first mathemati¬ 
cal investigations soon showed that the unqualified 
thin-sandwich conjecture is false; see |30| and [41]. 

To see what positive statements can be made let 
us summarize the situation: At fixed time, i.e. on 
each 3-manifold E with given Riemannian metric 
h and given parameter-time derivative h (covari¬ 
ant symmetric second-rank tensor field), equations 


(268) form a system of three quasi-linear (though 
highly non-linear) second order equations for the 
three components Pa. The restriction a > 0, where 


a is given by (265), involves P and hence implies 


an a priori bound for the unknown p. As a conse¬ 
quence one may first of all expect only local results 
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(if at all), in the sense that if /3* is a solution for 
given then there exists some open neigh¬ 

borhood U of (/i*, h*) in field space (in an appropri¬ 
ate topology) such that existence and uniqueness of 
solutions follows for all (/i, h) G U. Such a local re¬ 
sult can only be expected if the initial velocity h*, 
or any of its space-point dependent reparametriza- 
tions, is not just of the form of an infinitesimal dif- 
feomorphism In other words, (h*,/i*) must 

be chosen such that for any smooth function 77 and 
vector field ^ on S we have the following implica¬ 
tion 


ht = rj 77 = 0 and ^ = 0. (270) 


Assuming (270), local results were first proven in 


[25] and subsequently in a more geometric form and 
with generalizations, also including matter fields, 
in m- The idea of proof is to write (3 = + 6l3 

and linearize the differential operator (268) acting 


on 6p around the solution (/3*; ft,*, h*). The result¬ 
ing linear operator turns out to be symmetric (no 
surprise, being the second functional derivative of 
S'bsw in P) with a principal symbol a^k) whose de¬ 
terminant is proportional to a power of \\k\\ times 
[ 7 r(fc, ft)]^ (see eq. (3.14) in [Tl]), which vanishes 
for non-zero k iff 7 r(ft, k) = 0. Hence the linearized 
operator is elliptic iff the quadratic form tt is ei¬ 
ther positive or negative definite. Granted this. 


(270) then ensures that the elliptic operator has 


a trivial kernel. Together this allows to deduce an 
implicit-function theorem that immediately implies 
local existence and uniqueness. As regards ellip- 
ticity, we recall that from the discussion following 


(240) that a definite tt is equivalent to G{tt,tt) < 0, 


i.e. cr = — 1 (compare (266l). Hence a local ex¬ 


istence and uniqueness result holds provided that 
(ft*,ft*,/3*) satisfy a = —1, comprising the two 
equations 


eR{h.t) > 0, 

G(ft* ft* , ft* ft* , ) ^ 0 . 


(271) 


We see that we actually were not free in choosing 


either sign for a in (266): We are bound to cr = 


— 1 in order to ensure at least local existence and 


uniquness of the thin-sandwich equation (268). 


In passing we note the choice of the negative sign 
cr = — 1 has very different topological consequences 
depending on whether e = — 1 (Lorentzian space- 
time), where it implies i? < 0, or e = 1 (Rieman- 
nian/Euclidean spacetime), where it implies i? > 0. 
In the former case, given any orientable closed 3- 
manifold E, the theorem of Kazdan & Warner (see 
paragraph above Section [ 6 T] ) ensures the existence 
of a Riemannian metric ft on E with negative scalar 
curvature. In contrast, in the latter case (Euclidean 
spacetimes), the theorem of Gromov & Lawson im¬ 
plies a severe topological obstruction against Rie¬ 
mannian metrics ft on E with positive scalar cur¬ 
vature (as already discussed above: only connected 
sums of handles and lens spaces survive this ob¬ 
struction) . 


The result on local uniqueness can be generalized 
to a global argument, first given in [30] and gen¬ 
eralized in m to also include matter fields. On 
the other hand, it is not difficult to see that global 
existence cannot hold, i.e. there are more or less 


obvious data (ft, ft) for which (268) has no solution 
for /?; see, e.g., m- 

Einally we mention the conformal thin-sandwich 
approach of York’s m. which is a conceptually 
weaker but mathematically less awkward variant of 
the full thin-sandwich problem, in which only the 
conformal equivalence class of the metric and its 
time derivative is initially specified, together with 
the lapse function and the extrinsic curvature. The 
constraints are then solved for the conformal fac¬ 
tor and the shift vector field. The equation for 
the conformal factor is as in the conformal method 


discussed below (York’s equation (328)), but the 
equation for the shift is mathematically less awk¬ 


ward than in the full thin-sandwich equation (268). 


Recall that the latter is rendered complicated to 


the insertion of the solution (265) for a, which is 


precisely what is not done in the conformal variant. 
But, clearly, the price for not solving for the lapse 
is that we have again no initial control over the 
full metric. Nevertheless, the better behaved equa¬ 
tions of the conformal variant of the thin-sandwich 
method make it a useful tool in numerical investi¬ 
gations; see, e.g., [27] [771. 
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Comparison with Jacobi’s principle 

It is conceptually interesting to compare the 
Baierlein-Sharp-Wheeler action of GR with Ja¬ 
cobi’s principle in mechanics. So let us briefly recall 
Jacobi’s original idea [5Dj, where only the notation 
will be adapted. 

As in Section]^ we consider a (so far uncon¬ 
strained) mechanical system with n-dimensional 
configuration space Q. Let it be characterized by 
a Lagrangian L : TQ —K of the form 

Liq,v)=^^Ggiv,v)-Viq). (272) 


Here Gq is a positive-definite bilinear form on TqQ 
called the kinetic-energy metric . We already know 
that as L does not explicitly depend on time, any 
dynamically possible trajectory will run entirely 
within a hypersurface of constant energy E (the en¬ 
ergy function being ( 176| )). Maupertuis’ principle 
of least action, states that a dynamically possible 
trajectory a; : K D / —>■ Q, connecting fixed initial 
and final points qi and qg, extremizes the “action” 


condition expressed in terms of positions and ve¬ 
locities (measured with respect to t) introduces an 
implicit dependency of t with coordinates on TQ. 
These dependencies have to be respected by vari¬ 


ations of (2731. This is why Jacobi complained in 


his lectures that the form given above appears in¬ 
comprehensible to him{^ His crucial observation 
was that the energy condition allows to eliminate t 
altogether. Indeed, if we solve the energy condition 
for dt, 


dt = 


I Gq{dq,dq) 
2{E-V{q)) 


and use that to eliminate dt in 
the integral into the form 


(274) 


, we can put 


r ^2{E-V{q))Gq{dq,dq), (275) 

j Qi 


which is independent of any parametrization. In 
fact, it has the simple geometric interpretation of 
the length functional or the conformally rescaled 
kinetic-energy metric: 


rif 


^x{t) {^(^)) ^( 0 ) 


(273) 


relative to all other curves with the same end¬ 
points and on the same energy hypersurface. We 
note in passing that since xdt = dq and G{v,-) = 


dL{q,v)/dv = p, the integrals in (273) equal the 


integrals of the canonical one-form 9 (compare 
(196a)) along the paths q = x{t) and p = G[x{t), •) 
in T*Q with fixed endpoints of the curve pro¬ 
jected into Q and the curves all running on a hy¬ 
persurface of fixed value of the Hamiltonian func¬ 
tion. This is the form the principle of least action 
is given in modern formulations, like in Arnold’s 
book [5] (Chapter 9, Section 45 D). In this (modern, 
Hamiltonian) form, time plays no role. Indeed, on 
phase space T*Q the integral of the one-form 6, as 
well as the level sets for the Hamiltonian function, 
are defined without reference to any time param¬ 
eter t. But in the traditional (19th century, La¬ 
grangian) form stated above, the parameter enters 
in an essential way. In fact, in this formulation t 
is not an independent variable because the energy 


G = 2{E-V)G, 


(276) 


where E is a, constant. Jacobi’s principle then says 
that the dynamically possible trajectories of energy 
E are the geodesics of G, and that (Newtonian) 
time along such a geodesic is obtained by integrat¬ 


ing (274) along it. Note that Jacobi’s principle 
defines in fact two new metrics on configuration 
space Q, both of which are conformally equivalent 


to the kinetic-energy metric G. The first is (276) 


which determines the trajectory in Q, the other is 


G = 


G 


2{E-V) 


(277) 


which determines Newtonian time along the trajec¬ 
tories selected by the first. We call it the first and 
second Jacobi metric respectively. Note also that 


(274) gives a measure for duration, dt, in terms of 


changes of mechanical coordinates. 


^ “Dieses Prinzip wird fast in alien Lehrbiichern, auch 
den besten, in denen von Poisson, Lagrange und Laplace, so 
dargestellt, dass es nach meiner Ansicht nicht zu verstehen 
ist.” ([90], p.44) 
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In passing we remark that a special realization 
of this far reaching idea, namely to read off New¬ 
tonian time from simultaneous configurations (i.e., 
generalized positions) of mechanical systems, as¬ 
suming the systems to obey Newtonian laws of 
motion, is emphemeris (or astronomical) time in 
astronomy 133 where the relative configura¬ 
tions (ephemerides) of the Moon, Sun, and plan¬ 
ets (as seen from the Earth) are used as positions. 
Ephemeris time was first proposed as time standard 
in 1948 [in] in order to establish a reference with 
respect to which non uniformities in the Earth’s 
daily rotation could be accounted for, though the 
idea goes back at least to 1929 [48]. Ephemeris 
time ideed became the time standard in 1952 until 
the 1970s, when atomic time took over its place, 
though we remark that “atomic time” is really just 
based on a straightforward generalization of the 
very same principle to Quantum Mechanics. Here, 
again, one reads off time from simultaneously mea¬ 
surable states or observables of one or more systems 
obeying known deterministic “laws of motion”, like 
Schrodinger’s equation for states or Heisenberg’s 
equation for observables. These equations of mo¬ 
tion correlate the a priori unobservable parameter 
t with observable properties of the system, which 
render t observable in a context dependent fash¬ 
ion through inverting these relations (solving the 
equations of motion for t). 


to closely correspond to each other. Equation 


Now, coming back to gravity, (2751 should be 


compared to (2671. Except for the terms involv¬ 


ing the lapse-function (3 the latter is (almost!) like 
(2751 for if = 0 and V = eR. The analogy is 


incomplete (hence “almost”) because in (267) the 
spatial integration is outside the square-root, so 
that the integrand for the parameter-integration 
along the curve in configuration space is a sum 
of square-roots rather than a single square-root 
of a sum. This difference renders the expression 


in (2671 different from usual “length functionals”. 


Note that sums of square roots (involving them¬ 
selves sums of squares) generally do not even form 
a Finsler metric (compare the discussion in [7T|1. 

Taking the analogy further, (2741 should be com¬ 
pared to ( 269b[ ) with a* given by the integral (265). 
Again, except for the terms involving j3, they seem 


(269b) determines one proper time per spatially 


fixed (with respect to the spatial coordinates) ob¬ 
server in the spacetime to be developed from the 
initial data. Hence there is something like a contin¬ 
uum of second Jacobi metrics, one for each space 
point. 

An interesting observation in this connection is 
the following [40], which we give in a simplified 
form. Suppose that we tried to define, from first 
principles, duration by some measure of change 
in the gravitational degrees of freedom, i.e., some 
kind of gravitational ephemeris time . In analogy 
to (274) We assume the “measure of change”, dr, 


to be given by some local rescaling of a pseudo- 
Riemannian distance measure 


ds^ = 

f d^x G' 


Jn 

so that 

dr 


ds^ 


fs ' 


(279) 


Here R must be a scalar function of the spatial 
metric h. The simplest non-constant such function 
is the scalar curvature, which depends on h and 
its derivatives up to second order. A priori such 
a measure of duration seems to depend equally on 
all gravitational degrees of freedom at all points 
in space, thus giving rise to a highly non-local 
concept of time with respect to which durations 
of processes, even local ones, can be measured. 
However, suppose we required that the measure 
of time be compatible with arbitrarily fine local¬ 
ization E —>■ 17 C E. Following [ID] we call this 
the chronos principle. It implies that the numera¬ 


tor and denominator of (279) are proportional for 
each restriction E —17. This is only possible if 
the integrands are proportional. Without loss of 
generality we can take this constant of proportion¬ 
ality (which cannot be zero) to be 1 (this just fixes 
the overall scale of physical time) and obtain (here 
written without cosmological constant for simplic¬ 
ity) 




dho^^i^x^ dhcd(.^^ 


dr 


dr 


-R[h]{x) = 0. (280) 
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which is just the Hamiltonian constraint. The du¬ 
ration in time is then given by 


AT{hi,hf) 



lG{dh/dX,dh/dX) 

-R[hW] 


dX, (281) 


which is obviously just the analog of the integrated 
version of (2741 for i? = 0 (which here amounts 
to the Hamiltonian constraint), i.e. the integrated 
version of ( 265| ) for /3 = 0 (this is the simplification 
we alluded to above) and e = — 1 . 


9 Asymptotic flatness and 
global charges 

Isolated systems are described by geometries which 
at large spatial distances approach a matter-free 
spacetime. In case of vanishing cosmological con¬ 
stant the latter will be flat Minkowski spacetime. 
For non-zero A it will either be de Sitter (A > 0) or 
anti-de Sitter (A < 0) space. Here we are interested 
in the case A = 0. We refer to the survey [5D] for 
a recent discussion of the anti-de Sitter case. 

An initial data set {h, tt) or (/i, K) on E needs 
to satisfy certain asymptotic conditions in order to 
give rise to an asymptotically flat spacetime. Be¬ 
fore going into this, we point out that there is also a 
topological condition on S in order to sensibly talk 
about asymptotic regions. The condition is that 
there exists a compact set AT C E such that its 
complement E — iF is diffeomorphic to the disjoint 
union of manifolds — where H is a closed ball. 
These pieces in which E decomposes if one cuts out 
increasingly large compact sets are called ends of 
E. In passing we note that the theory of ‘ends’ 
for topological spaces and groups was developed 
by Freudenthal in 1931 m- Now, the first condi¬ 
tion we pose is that there is only a finite number 
of such ends. (It is not hard to visualize manifolds 
with even an uncountable number of ends.) With 
respect to each end we can talk of approaching in¬ 
finity. This means to let r —>■ oo if r is the standard 
radial coordinate on — B to which this end is 
diffeomorphic. 


A first working definition of asymptotically flat 
initial data in Hamiltonian GR was given in 1974 
by Regge & Teiltelboim |105j . It was shown by 
Beig & 6 Murchadha in 1986 [53] that this defini¬ 
tion is sufficient to allow the implementation of the 
10-parameter Poincare group as asymptotic sym¬ 
metries giving rise to 10 corresponding conserved 
quantities. The definition can be given as follows 
(here we restrict to one end): 

Definition (Regge-Teitelboim asymptotic flat¬ 
ness). Let T, be a 3-manifold with one end. An 
initial data set {h, tt) on E is asymptotically flat in 
the sense of Regge- Teitelboim if there is a coordi¬ 
nate system {x^, , x^} covering the end, such that 

as r := \/x°‘x^5ab oo) 

habix) = 6ab + 02 {r ~‘^), (282a) 

r 

fah( \ 

^“'’(x) = ^:^+Oi(r-3), (282b) 

where x = {x^,x^,x^) and v = , v^) with 

:= x°‘ jr. Ok{r~'^) denotes terms falling off like 
1 /r” and whose l-th derivatives fall off like 
for 0 < I < k. Moreover, Sab xnd t°'^ obey the 
parity conditions 

Sab{-v) = Sab{v) , (283a) 

t“'>(-j.) = -e\v ). (283b) 


The first thing to observe is that these conditions 
suffice to make the integral (251) for the symplec- 
tic potential convergent. Note that ( 282| ) merely 
implies that the integrand falls off like l/r^, which 
could still produce a logarithmic divergence. But 
(283) implies that the l/r^ integrand is of odd par¬ 
ity and hence gives no contribution. Next we have a 
look at the constraint functionals (242). In (242a) 
the first integrand has a l/r'^ and the second a 
1/r^ parity-even fall-off. In (242b) the integrand 
has also a 1/r^ parity-even fall-off. Hence the in¬ 
tegrals (242) certainly converge for those lapse and 
shift fields a, f which asymptotically either tend 
to zero or approach direction-dependent constants 
in a parity-odd fashion. As we will see below, the 
constraints for such parameter fields a and f are 
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differentiable with respect to the canonical vari¬ 
ables h and tt and hence generate a Hamiltonian 
flow that has to be considered as gauge transfor¬ 
mations; compare Section[7!Tj Hence we set 


a(a^)gauge = a{v) + 02 {r 
/3“(a:)gauge = b°‘{v) -f Oi(r~^ 




where 


a{—v) = —a(v ), 
b^{-u) = -b^{v). 


(284a) 

(284b) 


(285a) 

(285b) 


To see that Cs(a)[/i, tt] and C^(/3)[/i, tt] as defined 
in (242a) and (242b) are functionally differentiable 


with respect to h and tt we make the following ob¬ 


servations: For (242a I the only boundary term one 


might pick up is that from the variation of the 
scalar curvature with respect to h, which follows 


from (117) (in which formula we have to replace 
gab with hab and hab with Shah in order to match 
the notation here). It reads 


^ f d^laVhl^aG'^^'^^'DbShad 

2k Jg2 


(286) 


and is thus seen to have an integrand that has 1/r^ 
fall-off and is parity-odd. Hence the integral van¬ 
ishes. Note that here and in what follows we used 
the following shorthand notation 


'SI, 


dn{' 


lim ■ 

1—yoo 


/52(r) 


dflri---) > , (287) 


where S'^{r) is the sphere of constant “radius” r (as 
defined above) and dflr its induced volume form. 

The vector constraint ( 242b| ) contains tt as well 
as h in differentiated form (the latter in D), so that 
boundary terms may appear in both variation, that 
with respect to tt as well as that with respect to 
h. For both cases it is convenient to rewrite the 


integral (242b) by performing a partial integration 


before variation: 


a(/3) : = f d^x /3“ 

Jn 


‘Zch(ii)]D qTt 


he 


= C [ d?'x{Lph)abT^°'^ 


-2c dni3^habhcdl^‘^TT^G 

Jst, 


( 288 ) 


Under the fall-offs and parity conditions mentioned 
above the last (surface) integral is zero since its 
integrand has 1/r^ decay and is parity-odd. Hence 
variation with respect to tt does not lead to surface 
terms. Variation with respect to h is now simply 
given by varying the h under the Lie differentiation 
L/d which itself has no dependency on h (unlike 
the covariant derivative D). Using Lp{6habT^°'^) = 
Dc{fd^6hab'^‘^’’), partial differentiation with respect 
to the Lie derivative then gives the surface term 


2 c / dH habv°‘p ShedTr' 


cd 




(289) 


the integrand of which again falls off like 1/r^ and 
is parity odd. 

The considerations so far show that the con¬ 
straints Cs{a)[h,n] and Cy{l3)[h,Tr] are differen¬ 
tiable with respect to h and tt whenever (h, tt) sat¬ 


isfy (282) and (2831 and the parameter-functions 


(a,/?) satisfy (284) and (285). From the consider¬ 


ations it also follows that we cannot improve on 
the latter two conditions given the fall-offs and 
parity conditions on (/i,7r). This characterizes the 
lapse and shift functions which generate pure gauge 
transformations in Hamiltonian gravity for asymp¬ 


totically states. We stress that (284) and (285) in¬ 


cludes motions that do not vanish at infinity. These 
are called supertranslations. Without their care¬ 
ful inclusion into the transformations considered 
as gauge, we would not obtain the Poincare group 
as proper physical symmetry group but rather an 
infinite-dimensional extension thereof. For more 
discussion on this conceptually important point 
compare the discussion in |69) . 

Motions characterized by functions (a,/3) out¬ 
side the class (284) and (285) do change the phys¬ 
ical state. If this motion is to be generated by 
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the Hamiltonian (241) we must restrict to those 


(a, /3) for which suitable boundary terms can be 
found such that i/(a, tt] is differentiable with 
respect to h and tt obeying (282) and (283). To ac¬ 


commodate asymptotic Poincare transformations 
we must worry about asymptotic translations in 
time and space directions, asymptotic rotations, 
and finally asymptotic boosts, all of which we 
only need to specify modulo gauge transformations. 
Asymptotic time translations correspond to con¬ 
stant a. The surface term that results from the 


variation of Cs{a) is just (286). It immediately fol¬ 
lows that the term that has to be added to Cs{a) 
so as to just cancel this surface term upon variation 
with respect to h is just aE^oM, where 

Aadm = Madmc^ 

— &(2/tj / dH (^dahab db^aa'jb^b 5 

(290) 

is called the ADM energy and Madm the ADM 
mass. Note that we just replaced all non dif¬ 


ferentiated hab that appear in (286) by 6ab since 
the difference does not contribute to the surface 
integral in the limit as r —>■ oo. Similarly, asymp¬ 
totic space translations corresponds to covariantly 
constant /3, i.e. constant components /3“ with re¬ 
spect to the preferred coordinates that served to 
define asymptotic flatness. Again we immediately 


read off (288) the boundary term that we need to 


add in order to cancel that in the last line of (288) 


upon variation of tt. It can be written in the form 
cPadm(/ 3), where 

Padm(/3) = 2/ dQ^‘^TTabiy^ . (291) 

This we call the ADM general momentum. Note 
that the integrand has fall-off 1/r^ and even parity 
and hence gives a finite contribution. Furthermore, 


it follows from (289) that the variation with respect 


to h does not give rise to a boundary term since the 
integrand in (289) has 1/r^ fall off and hence does 


not contribute, independently of its (even) parity. 

Asymptotic rotations and boosts are a priori 
more delicate since now a and /3 are allowed to 


grow linearly with r. We have, up to gauge trans¬ 
formations, a oc UaX'^ for a boost in it-direction 
and pa oc Sabc^xfix^ for a rotation around the Cu 
axis. Here Sabc are the components of the metric 
volume-form for the asymptotic metric <5 with re¬ 
spect to the coordinates {a;“}, so that Sabc = ±1, 
depending on whether {abc) is an even (-)-) or odd 
(—) permutation of (123). As indices are raised 
and lowered with respect to 5, we need not be con¬ 
cerned whether they are upper or lower (as long as 
we work in components with respect to {x°'}). The 
components Ua and are then again constants. 

We start with rotations and read off the last line 


of (288) that the surface integral has an integrand 


oc TT^Pa^b^ which looks dangerous as its naive fall- 
off is 1/r. However, if we use that tt actually sat¬ 
isfies the constraint, = 0, we can convert 

this surface integral to a bulk integral whose inte¬ 
grand is proportional to Db('K°'^Pa) = 7r“*’H(a/3b). 
But D^^aPb) = d(aPb) - and P is Killing with 

respect to the metric 5ab, so that d(^aPb) = 0- Now, 
the Christoffel symbols T^j, for the metric h fall off 
as l/r^ with odd parity, so that T^^/Jc falls off as 
Hence TT°‘^D{^aPb) falls off like 


1 jr with even parity. 

1 /r^ with odd parity, showing that this volume in¬ 
tegral also converges (a logarithmic divergence be¬ 
ing just avoided by odd parity). Finally we observe 
that for asymptotic rotations there is still no sur¬ 


face term of the form (289), since its integrand has 
1 /r^ fall-off and is of odd parity. As a result we 
have that even for asymptotic rotations we obtain 


the same formula (291) for the (linear or angular) 
momentum as long as the Regge-Teitelboim condi¬ 


tions (282) and (283) are satisfied and {h, tt) satisfy 
the constraints. In components with respect to the 
asymptotic coordinates the components of the lin¬ 
ear momentum are 


^ADM “ ^ ’ 

Jst. 


and for the angular momentum 


"^ADM = 2 / dn SabcX^TT'^’^Vd , 


'SI. 


(292) 


(293) 


where Sabc is as above. 
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Turning now to the boosts, we have 

Oi — Ua^ CTgauge ■ (294) 

We need to repeat the same steps that previously 


led us to (2861. Now, according to (117) the space 


integral over the divergence term in the variation 
of the scalar curvature is 

X := f (295) 

Jy, 

A first integration by parts leads to 


X = 




'S'L 


(296) 


- / dPxVhG^^^'^{Daa)Dt,5had- 


One more integration by parts of the second term 
gives 


X = 


'St. 


dnaVhi^aG‘^^‘^‘^Dbdhcd 


- [ dnVh{Daa)iybG^^‘^^6hcd (297) 
->si^ 

+ [ d^xVhG‘^^^‘^{D^Dba)6h^d- 

Jy 


Equation (294) implies that D^a has 1/r^ fall- 
off with odd parity. Hence the last (volume) in¬ 
tegral in (297) has 1/r^ fall-off with odd par¬ 
ity and hence converges. It gives rise to a term 
oc \/liG°‘^'^'^DaDba in the Hamiltonian equation for 
According to the general strategy the surface 
integrals must be taken care of by adding suitable 


surface integrals to the right-hand side of (241) so 


as to just cancel s/{2k) times the integrals just 
found as resulting from the variation of the scalar 


curvature in (242). Hence the right surface terms 
to be added to (241) are of the form MqX“, where 


^ Iy dVLx^^ (dbKc - dcKb) Vc 

y* dH {{hab ^ah)^b i^bb ^bb)^a) ^ ■ 

(298) 


The coordinates Z°' of the center of mass are then 


defined by the rescaled forms of (298), with rescal¬ 
ing factor Aadm: 


:= 


X" 


EIadm 


(299) 


In order to arrive at (298) we wrote Shab = S{hab — 


Sab) and left the difference {hab — Sab) rather than 
just hab under the integral in order to not keep 
the asymptotically constant term of (282a) under 


the integral when pulling the variation S outside it 
(cf. [55] AppendixC). 

It has been shown in |49| that the expression 
( |298 ) for the (unsealed) center of mass coincides 
with the geometric definition of Huisken and Yau’s 
|M] . The latter is defined by means of mean- 
curvature foliations of E. Its relation to alterna¬ 
tive definitions, including not only ADM but also 
a definition due to R. Schoen, using asymptotically 
conformal Killing fields, is discussed and lucidly 
summarized in [83] . 

So far we have been working with the particular 


asymptotic conditions (282) and (283). We have 


been arguing for existence of certain quantities to 
be identified with physical quantities of energy, lin¬ 
ear and angular momentum, and center of mass. 
But what about uniqueness? All these quantities 
depend a priori on the choice of the asymptotic 
coordinates within the set of all coordinates satis¬ 
fying the given fall-off conditions. Hence one needs 
to prove that this dependence is actually spurious 
and that, consequently, these quantities are geo¬ 
metric invariants. For the ADM mass and linear 
momentum this has been shown in |25] and [43] . 
Moreover, ignoring angular momentum and cen¬ 
ter of mass, these proofs were given under much 
weaker asymptotic conditions, in fact the weakest 
possible ones. Regarding the latter, we recall that 
it was shown in m by means of explicit coordinate 
transformations that the expression can be made 
change its value if \hab — Sab\ < with a < 1/2; 
see also the lucid discussion in (SD). Hence we cer¬ 
tainly need a > 1/2. That this indeed suffices to 
prove existence and uniqueness was established in 
[17)211251143]. This fits nicely with recent general¬ 
izations of stability results of Minkowski space by 
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Bieri m, which work under the following asym- 
totic decay conditions, where a > 1/2: 

hab = Sab + 02 ( 7 ’““), (300a) 

= (300b) 


These conditions suffice to establish ADM energy 
and linear momentum not only as being well de¬ 
fined, but also as being preserved under Hamilto¬ 
nian evolution. 


At first sight (300a I might seem too weak to 


guarantee existence of (290). The reason why it 


is not is, in fact, easy to see: If we convert (290) 


into a bulk integral using Gauss’ theorem, the in¬ 
tegrand contains a combination of 2nd derivatives 
of h which just form the 2nd derivative part of 
the scalar curvature. Using the scalar constraint, 
which schematically has the form 


+d'^h+{dhf = 0, 


(301) 


simple and coordinate invariant expression is 


MKomar = ^ / ■ (302) 


Here K is the timelike Killing vector field so nor¬ 
malized that lim,r_>oo g{K, K) = e. There exist 
various proofs in the literature showing MKomar = 
Madm; see, e.g., [1H][1D][13] and Theorem 4.13 of 

m- 


Since the Komar mass is frequently used in ap¬ 
plications, let us say a few things about it. From 
a mathematical point of view the main merit of 
(302) is that it allows to associate a “mass” to any 
2-dimensional submanifold S € M, independently 
of any choice of coordinates. We call it the Komar 
mass of S: 


MKomar(^) = ^ / *dK'’ . (303) 

Js 


this can be written as a bulk integral containing 
only integrands of the form cx tt^ and cx (dh)^, 
which according to (300a I fall off like 
i.e., faster than 1/r^. Hence the bulk integral con¬ 
verges. But note that the conditions (300) do not 
suffice to ensure the existence of conserved quan¬ 
tities regarding angular momentum or center of 
mass. Alternative conditions to (282) and (283) 
for the existence of angular momentum have been 
discussed in [5D] and [S]. 

We recall that even in the context of the Regge- 
Teitelboim conditions ( |282 ) and (283) we needed to 
invoke the fact that (h, tt) satisfy the constraints in 
order to conclude sufficiently strong fall-offs. This 
we have already seen explicitly in the discussion 
on, e.g., the existence of angular momentum in the 
paragraph above equation (292). From the scalar 
constraint (301) we now learn that (282) and (283) 
implies a l/c fall-off for G'^’^‘^‘^dadbhcd, and not 
just 1/r^ as naively anticipated from ( |282[ ). 

Alternative expressions for mass/energy exist in 
cases of symmetries. For example, for asymptot¬ 
ically flat and stationary solutions to Einstein’s 
equations, the ADM mass Madm is known to co¬ 
incide with the so-called Komar mass |94|, whose 


For S —)■ S'^ its interpretation is that of the ADM 
mass, whose physical significance as the value of the 
Hamiltonian (divided by c^) endows it with a sound 
physical interpretation. But what might the inter¬ 
pretation be for general S7 Well, suppose S = dB, 
where H is a 3-dimensional spacelike submanifold 
of M. Then, by Stokes’ theorem: 


MKomar (S') 



(304) 


Here we used the general identity for the square of 
the Hodge star restricted to p-forms in n dimen¬ 
sions, 

* ° * I A.(M) = ^ (-1)"^”-^^ IdAP(M) , (305) 


and also the general formula that allows to express 
in terms of the covariant divergence V • on the 
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first index with respect to the Levi-Civita connec¬ 
tion, 


* od o * 


Ap(M) 


= e(-l) 


hp-i-i) 


(306) 


In the final step of (304) we used that any Killing 


vector-field satisfies the identity (again for the 
Levi-Civita connection): 


VaVfcKe = 


abc 


Kd. 


(307) 


If the spacetime satisfies Einstein’s equation we can 
use ([^ t o eliminate the Ricci tensor in the last line 
of (3031 in favor of the energy-momentum tensor. 


This shows that if S has two connected components 
Si and S 2 , and if T|b = 0 , then (choosing the 
relative orientations of S'! and S 2 appropriately) 
iWKomar(S'i) = A7Komar(S'2)M. For a finite-size Star 
or a black hole this means that we may take any S 
to calculate the Komar- and hence the ADM mass, 
as long as S' U bounds a 3-dimensional region 
B on which T vanishes. In particular, S may be 
taken as any 2-sphere outside the star’s surface. 
More specifically, consider a static star where K 
is the hypersurface orthogonal Killing vector-field. 
The topology of the hypersurfaces orthogonal to 
K inside the star shall be just that of a ball in 
then we express the star’s ADM energy by the 
Komar integral over the star’s surface S and that, 


in turn, by the bulk integral (304) over the star’s 


interior, where we replace the Ricci tensor by T. 
This results in the so-called Tolman mass (see m 
and § 92 of |115j l. which in our notation reads: 

MToi-ma.n = ^ X ^/dei(h) X 

db (308) 

iJeg{K,K) [T(n, n) - eTr,,(T) . 

Here n := K/yjeg{K, K) is the normal to the hy¬ 
persurfaces and Tr/i the trace with respect to the 
spatial metric h, where, we recall, g = e v}’ ’Si'n)’ + h. 
In the Lorentzian case (e = —I) we see that 
the first r(n, n)-term in (308) is just the integral 
over the spatial energy-density of the matter di¬ 
vided by and weighted by the redshift factor 
i/—g{K,K). The additional term is absent if the 


pressures are negligible compared to the energy 
density, but this need not be the case. For ex¬ 
ample, if T is that of an electromagnetic field, we 
have Trg(T) = eT{n,n) + Tr^(T) = 0, so that 
the pressure effectively doubles the contribution 
of the first term to the overall mass. This is the 
origin of the infamous “factor-2-anomaly” of the 
Komar mass, which, e.g., leads to the result that 
the difference between two Komar masses evalu¬ 
ated on two different 2-spheres of spherical sym¬ 
metry in the Reissner-Nordstrpm manifold (elec¬ 
trically charged black-hole) gives twice the elec¬ 
trostatic field energy stored in the region between 
the spheres. On the other hand, for a spherically 
symmetric perfect-fluid star, Tolman has shown 
in m that in a weak-field approximation the 
leading-order difference between (308) and the in¬ 
tegrated mass-density of the fluid is just the New¬ 
tonian binding energy, which makes perfect sense. 

At this point we should mention the positive- 
mass theorem (for Lorentzian signature e = —1), 
which states that for any pair (h, tt) of initial data 
satisfying the constraints Madm > 0, with equal¬ 
ity only if the data are that of Minkowski space. 
Note that the expression (290) for Madm is a func¬ 
tional of h alone, but that in the formulation of the 
positive-mass theorem given here it is crucial that 
for h there exists a tt so that the pair {h, tt) solves 
the constraint. Otherwise it is easy to write down 
3-metrics with negative ADM mass; take e.g. (314) 
(see below) with tq replaced by —Tq, where tq > 0, 
suitably smoothed out for smaller radii so as to 
avoid the singularity at r = rg. Since the ADM 
mass only depends on the asymptotic behavior it 
is completely independent of any alterations to the 
metric in the interior. If one wishes to make the 
positive mass theorem a statement about metrics 
alone without any reference to the constraints one 
has to impose positivity conditions on the scalar 
curvature. But that also imposes topological re¬ 
strictions due to the result of Gromov and Lawson 
[75] mentioned at the end of Section|^ For a recent 
up-to-date survey on the positive-mass theorem we 
refer to m- 

Note that Madm = MKomar implies the posi¬ 
tivity of MKomar = Mxomar (-S*^) • But this doeS 
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not imply that -MKomar(5'^) is also positive for 
general S^. In fact, explicit examples of regular, 
asymptotically flat spacetimes with matter satisfy¬ 
ing the hypotheses of the positive-mass theorem are 
known in which MKomar(<S'^) < 0 for suitably cho¬ 
sen 2-spheres [T]. The recipe here is to regard two 
concentric counter-rotating objects in an axially- 
symmetric and stationary spacetime, e.g., an out¬ 
side perfect-fluid ring and an inner rigidly rotating 
disk of dust. The Komar mass of the disk (i.e. 
encloses the disk but not the ring) may then turn 
out negative if the frame-dragging effect of the ring 
is large enough so as to let the angular velocity and 
the (Komar) angular momentum of the central ob¬ 
ject have opposite signs. 

In passing we remark that the positive mass the¬ 
orem in combination with the equality Madm = 
fWKomar gives a simple proof of the absence of 
“gravitational solitons”, i.e. stationary asymp¬ 
totically flat solutions to Einstein’s equations on 
E = K^. This follows from (302) and d-kdK^ oc 
*iA-R.ic. The vaccum equation Ric = 0 then im¬ 
plies Madm = MKomar = 0 which implies that 
spacetime is flat Minkowski. This theorem was 
originally shown for static spacetimes (i.e. hyper¬ 
surface orthogonal K) by Pauli and Einstein [55] 
and later generalized to the stationary case by Lich- 
nerowicz m- The result of this theorem cannot be 
circumvented by trying more complicated topolo¬ 
gies for E. As soon as E becomes non-simply con¬ 
nected (which in view of the validity of the Poincare 
conjecture will be the case for any one-ended man¬ 
ifold other than K^) we know from Gannon’s theo¬ 
rem [63] that the evolving spacetime will inevitably 
develop singularities. 

Finally we mention that under suitable fall- 
off conditions we can hnd the Poincare group as 
asymptotic symmetry group [551. It will emerge 
from (260) as equivalence classes of all hypersur¬ 
face deformations, including those in which a and 
P asymptotically approach rigid translations, ro¬ 
tations, or boosts. The quotient is taken with re¬ 
spect to those deformations which are generated by 
the constraints, in which a and /3 tend to zero at 
spatial infinity. There are various subtleties and 
fine tunings involved for the precise fall-off condi¬ 


tions that are necessary in order to exactly obtain 
a 10-dimensional symmetry as a quotient of two 
infinite-dimensional objects. This is particularly 
true for asymptotic boosts, for which one needs to 
tilt the hypersurface, corresponding to asymptotic 
lapse functions a oc r. (Boosted hypersurfaces are 
known to exist in the development of asymptoti¬ 
cally flat initial data |42|.) But leaving the ana¬ 
lytic details aside, the qualitative picture is quite 
generic for gauge field theories with long-ranging 
field configurations [51] : A proper physical symme¬ 
try group arises as quotient of a general covariance 
group with respect to a proper normal subgroup, 
the latter being defined to be that object that is 
generated by the constraints. 


10 Black-Hole data 


In this section we discuss some simple solutions to 
the vacuum Einstein equations without cosmologi¬ 
cal constant. We first specify to the simplest case 
of time symmetric conformally flat data. Time 
symmetry means that the initial extrinsic curva¬ 
ture vanishes, AT = 0. The corresponding Cauchy 
surface will then be totally geodesic in the space- 
time that emerges from it. The vector constraint 


(144b) is identically satisfied and the scalar con¬ 


straint (144a) reduces to scalar flatness 


R{h) = ScaW = 0 . 
Conformal flatness means that 
h = Vt‘^ 5, 


(309) 


(310) 


where 5 is the flat metric. From (|125a| we infer 

(311) 


that (309) is equivalent to 17 being harmonic 
Aill = 0 


where is the Laplacian with respect to the flat 
metric 5. We seek solutions 17 which are asymp¬ 
totically flat for r —>■ oo and give rise to complete 
manifolds in the metric structure defined by g. The 
only spherically symmetric such solution is 

17(r) = 1 + ^ , (312) 
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where the integration constant ro can be related to 


the ADM mass (2901 by 

Madm = 2c^rojG. 


(313) 


This solution is defined on E = — {0}. The 

metric on E so obtained is 

h= + {dr^+r‘^id0^+sm^0dip^)) . (314) 

It admits the following isometries 

li{r,0,ip) := {rl/r,0,ip ), 


l2[r,0,tp) := (ro/r,7r- 6»,(^ + 7r). 


(315a) 

(315b) 


Note that the second is just a composition of the 
first with the antipodal map (r, 0, (p) i—)■ (r, tt — 
0,ip + tt) which is well defined on — {0}. This 
makes I 2 a fixed-point free action. The fixed-point 
set of Ii is the 2-sphere r = tq. Note that gener¬ 
ally a submanifold that is the fixed-point set of an 
isometry is necessarily totally geodesic (has van¬ 
ishing extrinsic curvature). To see this, consider 
a geodesic that starts on and tangentially to this 
submanifold. Such a geodesic cannot leave the sub¬ 
manifold, for if it did we could use the isometry to 
map it to a different geodesic with identical initial 
conditions, in contradiction to the uniqueness of 
solutions for the geodesic equation. Hence the 2- 
sphere r = tq has vanishing extrinsic curvature and 
is therefore, in particular, a minimal surface (has 
vanishing trace of the extrinsic curvature). The ge¬ 
ometry inside the sphere r = rp is isometric to that 
outside it. This is depicted in Fig.|^ 

For the data {h = (314), AT = 0) on E = —{0} 


we actually know its maximal time evolution: It 
is the Kruskal spacetime inaizi] which maximally 
extends the exterior Schwarzschild spacetime. Fig- 
ure[^ shows a conformal diagram of Kruskal space- 
time. 

In Kruskal coordinates (Kruskal [53] uses {v,u), 
Hawking Ellis [75] for what we call (T,X)) 

(T, X, 0, ip), where T and X each range in (— 00 , 00 ) 
obeying — X^ < 1, the Kruskal metric reads (as 
usual, we write dfl"^ for d0^ + sin^ 0 dp^): 

g = exp(—r/ro) {—dT^ -I- dX^) + , 

( 316 ) 



Figure 4: Cauchy surface with time symmetric initial 

data and two isometric asymptotically flat ends separated 
by a totally geodesic 2-sphere. 


where r is a function of T and X, implicitly defined 
by 

((r/ro) - l) exp(r/ro) = . (317) 


The metric is spherically symmetric and allows for 
the additional Killing field 


K = {Xdr + Tdx) 


(318) 


which is timelike for |A| > |r| and spacelike for 
|A| < |T|. 

Both maps (315) extend to the Kruskal manifold. 
The fixed-point free action (315b) has the extension 

J : (T, A, 0, p) ^ (T, -A, tt-0,p + t:). (319) 


It generates a freely acting group Z 2 of smooth 
isometries which preserve space- as well as time- 
orientation. Hence the quotient is a smooth space- 
and time-orientable manifold, that is sometimes 
called the -geon. It represents the maximal 
time evolution of the data {h = (314), A = 0) 
as above, but now defined on the initial quotient 
manifold E = (K^ — {0})//2. It has only one 
asymptotically flat end and the topology of a once 
punctured real projective space KP^. Note that 
the map J preserves the Killing field (318) only 
up to sign. Had one chosen J' : (T, A, 0, p) 

(—T, —A, Ti — 0,p tt) as in m and |53|, one 
would have preserved K but lost time orientabil- 
ity. 
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*+ black-hole singularity *+ 



Figure 5: Conformally compactified Kruskal spacetime. 
The T axis points up vertically, the X axis horizontally to 
the right. The Cauchy surface of Fig. [^corresponds trio the 
hypersurface T = 0. The various infinities are: io spacelike, 
i± future/past timelike, and I± future/past lightlike infin¬ 
ity. The right diamond-shaped region corresponds to the 
usual exterior Schwarzschild solution containing one asymp¬ 
totically flat end. 


Within the set of conformally-flat and time sym¬ 
metric initial data we can easily generalize the so¬ 
lution (312) to (311) to include more than one 
monopole term on a multi-punctured For two 
terms we get 

r,/ \ _ 1 I I “2 

ilfr) — 1 — -)- — , 

n r2 


(320) 


where = ||;? — c)||. This represents two black 
holes without spin and orbital angular momentum 
momentarily at rest, with c) G representing the 
hole’s “positions”. The manifold has three ends, 
one for r —>■ oo and one each for —>■ 0. For each 
end we can calculate the ADM mass and get 


M =2{ai+a2)c^/G, (321a) 

Ml = 2 (^ai + ^ , (321b) 

M 2 = 2 (^02 + ^ , (321c) 


where ri 2 := ||ci — C 2 ||. Here M is the total mass 
associated with the end r oo and Mi is the in¬ 
dividual hole mass associated with the end Vi -G 0. 
The binding energy is the overall energy minus the 
individual ones. One obtains 

AE:=(M-Mi-M2y = + (322) 

ri2 


where the dots stand for corrections of quadratic 
and higher powers in GMi/c^ri 2 . This can be eas¬ 
ily generalized to any finite number of poles. Note 
that the initial manifolds are all complete, i.e. all 
punctures lie at infinite metric distance from any 
interior point. 

Other generalizations consist in adding linear 
and angular momentum. This can be done using 
the conformal method, which we now briefly de¬ 
scribe. Recall that we wish to solve the constraints 
(144) for T = 0 but now with K 0. Encour¬ 
aged by previous experience with the simplifying 
effect of conformal transformations, we now study 
the general conformal transformation properties of 
the left-hand sides of (144). Generalizing (310), we 
write 


hab = ^‘^hab , (323a) 

. (323b) 


Note that in view of (323a) the second equation is 
equivalent to 


Kab = n^-^Kab ■ (323c) 


We first wish to determine the power s that is most 
suitable for simplifying (144b I if written in terms 
of h and K. A slightly lengthy but straightforward 
computation gives 


+ (10 - 

+ {s-6)n-^^+^W^K^. 

(324) 


Here D is the Levi-Civita covariant derivative with 
respect to h and indices on barred quantities are 
moved with the barred metric. A suitable simpli¬ 
fication in the sense of conformal covariance would 
occur if only the first line in (324) survived. The 
other two lines cannot be made to vanish simulta¬ 
neously on account of a suitable choice of s. The 
best one can do is to choose s = 10 and restrict to 
traceless K, i.e. Kf = 0. From (323) this might 
seem as if we had to restrict to traceless K. But 
note that as the left-hand side of (144b) is linear 
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in K we can always add to any traceless solution 
if(i) 

a pure trace part 


= I'rhab : 


(325) 


which satisfies the vector constraint as long as r is 
constant. Putting all this together we see that we 
get a solution to the vector constraint if we main¬ 


tain (323a I but replace (323b) with 


= n- 






Kab = n-^Kab + \n^habT, 


(326a) 

(326b) 


where r is a constant and K is transverse traceless 
in the metric h\ 


= 0 . 


(327a) 

(327b) 


Note that K'^ = r so that the method as presented 
here only produces initial data of constant mean 
curvature. It can be generalized to non-constant r, 
see e.g. [86] . 

As before, the idea is now to let the remain¬ 
ing scalar constraint determine the conformal fac¬ 


tor. Inserting (323a) and (326) into the scalar con¬ 


straint (144a) and using (125), we obtain the fol¬ 


lowing elliptic York equation for 


12 " 

(328) 

Existence and uniqueness of this equation for the 
Lorentzian case e = — 1 is discussed in the survey 
[86] . It may be further simplified if, as before, we 
assume conformally flat intitial data, i.e. 


h = S = flat metric . 


(329) 


Then 

- eAsn + ^n-'^KabK’^^ - = 0. (330) 

where K is now transverse-traceless with respect to 
the flat connection (partial derivatives in suitable 
coordinates). 


It is remarkable that the ADM momenta (291) 


can be calculated without knowing fl. Hence we 


can parametrize solutions to (327) directly by the 
momenta without solving (328) first. Two solu¬ 


tions of particular interest for h = S are the Bowen- 
York data [37] [119] . In Cartesian coordinates and 
corresponding components they read 


"^{^aAb + VbAa - (Sab “ VaT^b)^'' , 
^^ab = r~^{^a£bcd + Ub£acd)B’^v‘^ , 


(331a) 

(331b) 


where := x°‘/r and where all indices are raised 
and lowered with the flat metric <5. A and B are co- 
variantly constant vector fields with respect to the 
Levi Civita connection for the fiat metric (5, which 
are here represented by constant components A^ 
and B‘^. One verifies by direct computation that 


they satisfy (327) with hab = Sab and Da = da- 


Furthermore, using (291) one shows that (331a) 


has vanishing angular momentum and a linear mo¬ 
mentum with components 


2c3 

pa A a 


(332) 


whereas (331b) has vanishing linear momentum 


and an angular momentum with components 


= 30^“ 


(333) 


They can be combined to give data for single holes 
with non-zero linear and angular momenta and 
also be superposed in order to give data for multi 
black-hole configurations. Such data, and certain 
modifications of them, form the essential ingredient 
for present-day numerical simulations of black hole 
scattering and the subsequent emission of gravita¬ 
tional radiation. 


Let us now return to equation (328), which we 


have to solve once suitable expressions for the com¬ 
ponents Kab have been found. As the metric is 
fiat at the end representing spatial infinity, i.e. 
D(r —>■ oo) = 1, it is clear that H cannot be 
bounded in the interior region. The idea of the 
puncture method, first proposed in [33], is to re¬ 
strict the type of singularities of D to be, in some 
sense, as simple as possible, which here means to 
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be of the pure monopole type that we already en¬ 


countered in (312) for a single hole and generalized 


to two holes in (320). This amounts to the follow¬ 
ing: Take S = —{ci, • • • , c„} and assume we are 

given suitable Kab which are regular in S, e.g., a 


sum of York data of the form ( |331 ), each centered 
at one of the punctures q. Accordingly, we write 


n = u -\ -, 

OJ 


where 


1 

UJ 


E tti 

rp . 


(334a) 


(334b) 


again with Vi := ||a? — c)||. The crucial assumption 
now is that the function u is smooth, at least C^, 
on all of including the points c). Since l/w 


is annihilated by A^, (328) applied to (334) then 


leads to a second order elliptic differential equation 
for u. In the simple case of conformally flat max¬ 
imal data, i.e. hab = 5ab and r = 0, we get in the 
Lorentzian case (e = —1) 


= - 1(1-t wu) Lo'^KabK‘^\ (335) 

where u —>■ 1 at spatial infinity. Now, at the i- 
th puncture, Kab diverges as (l/r^)^ for the data 
(331a) and as (l/^i)^ for the data (|331b|). This 


means that KabK°'^ diverges at most as (l/r^)®. 
But from (334b) we see that w vanishes as at c) 


so that KabK°'^ also vanishes at least as fast as 
at Ci and is hence continuous on all of Standard 
elliptic theory now allows to conclude existence and 
uniqueness of solutions to (335); see [38] for 


more details. It is then not difficult to see that the 
Riemannian manifold (S, h) so obtained has n -|- 1 
asymptotically flat ends whose ADM masses are 


readily calculated. Similar to (321), one obtains 


2c^ 

M = 

i=l 

2 c2 f ^ 

Mi = -— I UiUi 4- 2_^ 


G 




j=i 


(336a) 

(336b) 


where Ui := u(ci) and Vij := ||ci — Cj\\. Comparing 
this to (321) (and its obvious generalization from 


2 to n punctures) shows that the only difference 
in the analytic expression for the masses is the ap¬ 
pearance of Ui (instead of 1) in the first term on the 
right-hand side of (336b). Thus, formally, for fixed 
monopole parameters Ui, the switching-on of linear 
and angular momentum (here represented locally 
by trace-free extrinsic curvatures) adds to each in¬ 
dividual mass a term ai{ui — 1). This contribution 
is non-negative as a consequence of u > 1. The 
latter equation follows immediately from standard 
elliptic theory. Indeed, (335) implies A^u < 0, i.e. 


that u is superharmonic, and hence that u > / for 
any continuous harmonic / with the same bound¬ 
ary values, i.e. / = 1. Note that this argument 
relies on e = — 1. 

Finally we wish to point out an interesting type 
on non-uniqueness in writing down initial data of 
the form (331). It has to do with the question of 


whether we wish to enforce the inversion symme¬ 


tries of the type (315) to become isometries of the 
initial geometry. Let us focus on Ii as defined in 
(|315 ), where we now denote the radius of inversion 


a (rather than rg). Dropping the subscript 1, we 
have 

[/(:r)]“ = (aVr'")*“. (337) 


Its Jacobian is 


[A(x)]? = (aVr^)(<56“-2:^V,). 


(338) 


We note in passing that the matrix in round brack¬ 
ets is orthogonal with determinant —1. 

It follows that the conformally flat metric h = 
(5 satisfies I* h = h iS 


(a/r)(floI) =n. 


(339) 


In such a metric the sphere r = a is the fixed-point 
set of the isometry I and hence totally geodesic. 
In particular, this implies that it is a stationary 
point of the area function which is equivalent to 
dr(r^fl^) = 0 and hence to 


an 

dr 


VL 

2a 


= 0 , 


(340) 


which may also be directly verified from differenti¬ 


ating (339) with respect to r at r = a. This would 
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be the condition for (3301 at the “inner boundary' 


in order to produce a solution that gives rise to 
a metric h that has I as an isometry. But that, 
clearly, also puts conditions on the extrinsic curva¬ 
ture K, for h and K have to satisfy the coupled sys¬ 


tem of constraints (144) in the vacuum case T = 0. 


A sufficient condition is 


rx = ±K. 


(341a) 


Using (326b) and restricting to maximal data. 


= r = 0, this is equivalent to 

{a'^/r^)I*K = ±K, 


(341b) 


where we also made use of (339). 

Using the global chart {a;^, x^} on — {0} 
together wit the flat metric h = S, we have the func¬ 
tion r whose value at x is the (5-geodesic distance 
((a;^)^-|-(x^)^-|-(a;^)^) and the following vector fields 
and volume form (A“, 5“, Cabc being constant com¬ 
ponent functions) 


X 

V = — 

r 

A = A“ 

B = 


_^ 

r dx° 
d 
dx°- 
d 
dx° 


£ = hsabcdx’^ A dx^ A . 


(342a) 

(342b) 

(342c) 

(342d) 


The co-vector fields that arise from these vector 
fields via the isomorphism induced by the flat met¬ 
ric 6 are called , A*', Under the inversion-map 


(337), making also use of (338), these structures 


behave as follows: 


I* r = r o I = — 
r 


= —{a/r) 


- 2 . 




2 ,> 


= -{alrYv' 


BA = (a/r) ^(A — 2: 2 (z/• A)) , 

I*A^ = {a/rY{A^ A)) , 


(343) 

(344a) 

(344b) 

(345a) 

(345b) 


and identically for B, where a dot denotes the 
scalar product with respect to the flat metric, i.e. 
V ■ A = S^u, A), 


rS = {a/rYS, 


(346) 


ail' 


C0 


r£ = -{a/rf£. (347) 

These formulae allow to immediatly write down 


the /-transforms of the data (331). We have, in 
components. 


rAT, 


rA(l) 


ah 


= -X ^ {vaAb + VbAa + [Sab “ 5 VaVb)v‘'A^) . 

(348a) 


and 


BK^Yb = -(a/r)-2//(2) . (348b) 

This means that as given by (331b) is already 


antisymmetric in the sense of (341b), but (331a) is 


neither symmetric nor antisymmetric. Symmetric 
or antisymmetric data can be obtained by forming 
the symmetric or antisymmetric combination 


:= ± (a/r)2r/f(i) . 


(349) 


( 1 ) 


(350) 


which satisfies 

{a/rfrR^Y'^ = ±K'^ 

In components they read 

= l/R{vaAb + VbAa - (Sab “ VaVb)v^Ac) 

=F a^/R{vaAb + VbAa -I- (Sab “ 5 VaVb)v^Ac). 

(351) 

Having enforced symmetry with respect to the 


inversion (337) we will obtain an initial-data 3- 


manifold with two isometric asymptotically flat 

straightforward calculation using ( |338[ | first yields 
I*£ = [a/r)^(^€ — 2 A where ★ denotes the Hodge 

dual with respect to 5. But for any vector field u one triv¬ 
ially has A £ = 0 and hence, now assuming u to be also 
normalized, 0 = A e) = s — A i^s. Using iu£ = , 

this gives A = e and hence ||347|l. 
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ends whose Poincare charges coincide (possibly up 
to sign) and are given by those of the original data. 
This is trivially true for angular momentum and 
follows for linear momentum from the 1 /r^ fall-off 
of the second term in (3511. It is interesting to 
note that a term proportional to the second term 
in (351) follows in case of spherically symmetric 
extended matter sources 1361 . 


11 Further developments, 
problems, and outlook 

In this contribution we have explained in some de¬ 
tail the dynamical and Hamiltonian formulation of 
GR. We followed the traditional ADM approach 
in which the basic variables are the Riemannian 
metric h of space and its conjugate momentum tt, 
which is essentially the extrinsic curvature that E 
will assume once the spacetime is developed and E 
is isometrically embedded in it. Attempts to es¬ 
tablish a theory of Quantum Gravity based on the 
Hamiltonian formulation of GR suggest that other 
canonical variables are better suited for the mathe¬ 
matical implementation of the constraints and the 
ensuing construction of spaces of states and observ¬ 
ables [TnirTIlTIfM]. These variables are a (suit¬ 
ably densitized) orthonormal 3-bein held A on E 
and the Ashtekar-Barhero connection. We have al¬ 
ready seen that orientable E are parallelizable so 
that global helds E do indeed exist. Any held 
E determines a Riemannian metric h, which in 
turn determines its Levi-Civita connection. The 
Ashtekar-Barbero covariant derivative, V, differs 
from the Levi-Civita connection D oi h hy the 
endomorphism-valued 1 -form which associates to 
each tangent vector X the tangent-space endomor¬ 
phism Y 7 Wein(X) x Y, where 7 is a dimen¬ 
sionless constant, the so-called Barbero-Immirzi- 
parameter, which was hrst introduced by Immirzi 
in [85] on the basis of Barbero’s generalization [22] 
of Ashtekar’s variables. Hence we have 

VxY = DxY + 7 Wein(X) x Y . (352) 

The multiplication x is the standard 3-dimensional 
vector product with respect to the metric h. It is 


dehned as follows 

X xY ■.= [-k{X'’ AY'')]\ (353) 


where the isomorphisms b and j) are with respect 
to h (cf. 0)- The product x obeys the standard 
rules: It is bilinear, antisymmetric, and X x {Y x 
Z) = /i(X, Z)Y — h{X, Y)Z. Moreover, for any V, 
the endomorphism Y 1 —> A" x E is antisymmetric 
with respect to h, i.e. h{X xY,Z) = —h{Y, XxZ), 
and hence it is in the Lie algebra of the orthogonal 
group of h. In particular this is true for Y 1 —>■ 
Wein(V) x Y, showing that V is again metric, 
i.e. obeys Vh = 0 once its unique extension to 
all tensor fields is understood. Clearly, unlike D, 
the torsion of T) cannot be zero: 

T'^{X, Y) = VxY - VyX - [X, Y\ 

= 7 (Wein(X) x E - Wein(E) x X) . 

(354) 


Using ([77]) and index notation, the curvature tensor 
for V is 

_ tvD 

^abcd ^abcd 

+ ey{D,Kdn - (355) 

-y^{KacKM-KadKbc)- 
From this the scalar curvature follows 

Seal® = Seal® -h 7^ ■ (356) 


Camparison with (I44a I shows that for 7 ^ = e the 
gravitational part of the scalar constraint is just 
{e times) the scalar curvature of V. This striking 
simplification of the scalar constraint formed the 
original motivation for the introduction of V by 
Ashtekar [IS]. However, for £ = —1 one needs to 
complexify the tensor bundle over E for 7 = to 
make sense, and subsequently impose reality con¬ 
ditions which re-introduce a certain degree of com¬ 
plication; see, e.g., [ 68 ] for a compact account not 
using spinors. The usage of V in the real case was 
then proposed by Barbero in [^ and forms the ba¬ 
sic tool in Loop Quantum Gravity |113j . which has 
definite technical advantages over the metric-based 
traditional approach. 
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At this point we wish to inject one word of cau¬ 
tion concerning the possible geometric interpreta¬ 
tion of the of Barbero connection V, depending on 
the value of 7 . From the defining equation (3521 
it is clear that any V explicitly contains extrinsic 
information, i.e. information that refers to the way 
S is embedded into spacetime M. Moreover, un¬ 
less 7 ^ = e, this dependence on the embedding is 
such that T) cannot be considered as pull-back of a 
connection defined on (the bundle of linear frames 
over) M, as has been pointed out in m- The 
argument is simple: If it were the pull-back of such 
a connection on M, its holonomy along a loop in 
spacetime would be the same for all S containing 
that loop. That this is not the case for 7 ^ e can, 
e.g., be checked for the simple example where M is 
Minkowski space and the loop is a planar unit cir¬ 
cle that is contained in a flat spacelike hyperplane 
as well as in the constant-curvature spacelike hy¬ 
perboloid of unit future-pointing timelike vectors. 
The holonomy in the latter case turns out to be 
non-trivial (see |109] for the explicit calculation). 
For the Ashtekar connection, i.e. for 7 ^ = e, we 
know from its original construction that it is the 
pull-back of a spacetime connection (see, e.g., the 
derivation in [551 )■ Barbero’s generaliza¬ 

tions (3521 with 7 ^ ^ e, and in particular for all 
real values of 7 in the Lorentzian case {e = — 1), 
this means that it is impossible to attach a gauge- 
theoretic spacetime interpretation to T). 

Despite this conceptual shortcoming, the techni¬ 
cal advantages over the metric-based approach re¬ 
main. On the other hand, the latter is well suited 
to address certain conceptual problems [ 52 ], like 
e.g. the problem of time that emerges in those 
cases where the Hamiltonian (241) has no bound¬ 
ary terms and is therefore just a sum of constraints. 
This happens in cosmology based on closed E. The 
motions generated by the Hamiltonian are then 
just pure gauge transformations and the question 
arises whether and how ‘motion’ and ‘change’ are 
to be recovered; see, e.g., fT571[T(15| . 

Dynamical models in cosmology often start from 
symmetry assumptions that initially reduce the in¬ 
finitely many degrees of freedom to finitely many 
ones (so-called mini-superspace models). Other 


modes are then treated perturbatively in an expan¬ 
sion around the symmetric configurations. In these 
cases quantization in the metric representation can 
be performed, with potentially interesting conse¬ 
quences for observational cosmology, like the mod¬ 
ification of the anisotropy spectrum of the cosmic 
microwave background [ 53 ] [ 32 ] . All these attempts 
make essential use of the Hamiltonian theory as 
described in this contribution. 


12 Appendix: Group actions 
on manifolds 


Let G be a group and M a set. An action of G on 
M is a map 

^ -.Gx M -)■ M (357) 

such that, for all m G M and e G G the neutral 
element, 

$(e, m) = m, (358) 

and where, in addition, one of the following two 
conditions holds: 


<^{g,<P{h,m)) = <^{gh,m), (359a) 

$((/, $(/i, to)) = ^{hg,m). (359b) 


If ( 357[ ), ( 1358[ ), and ( |359a ) hold we speak of a left 
action. A right action satisfies (357), (358), and 
(359b). For a left action we also write 


^{g,m) =: g ■ m (360a) 


and for a right action 

$( 5 , to) =:m-g. 


(360b) 


Equations (359) then simply become (group mul¬ 
tiplication is denoted by juxtaposition without a 
dot) 


g ■ {h ■ m) = (gh) ■ to , (361a) 

{m ■ h) ■ g = TO • (hg ). (361b) 


Holding either of the two arguments of 4) fixed we 
obtain the families of maps 


: M M 

TO I—)• $(5, to) 


(362) 
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for each g G G, or 

■ G ^ M 
g i-G ^{g,m) 


(363) 


vector field on M, given by 


("T-) = ;77 ^ (exp (tX), to) 


t=o 


dt 

— ^m*€ 


{X). 


(364) 


for each to G M. Note that (358) and (359) imply 
that $g-i = ($g)“^. Hence each $g is a bijec- 
tion of M. The set of bijections of M will be de¬ 
noted by Bij(M). It is naturally a group with group 
multiplication being given by composition of maps 
and the neutral element being given by the identity 
map. Conditions (358) and (359a) are then equiv¬ 
alent to the statement that the map G -G Bij(M), 
given by 5 I—>■ is a group homomorphism. Like¬ 
wise, (358) and (359b I is equivalent to the state¬ 
ment that this map is a group anti-homomorphism. 


The following terminology is standard: The set 
Stab(TO) := {g G G : $( 5 , to) = to} C G is 
called the stabilizer of to. It is easily proven to 
be a normal subgroup of G satisfying Stab(g • 
to) = g(Stah{g ■ m))g~^ for left and Stab(TO • g) = 
5“i(Stab(5f • rn)^g for right actions. The orbit of 
G through to € M is the set Orb(TO) := {$((;, to) : 
g G G} =: 4>(G, to) (also written G • to for left and 
TO • G for right action). It is easy to see that two 
orbits are either disjoint or identical. Hence the 
orbits partition M. A point to G M is called a 
fixed point of the action $ iff Stab(TO) = G. An 
action $ is called effective iff ^{g,m) = m for all 
m G M implies 5 = e; i.e., “only the group identity 
moves nothing”. Alternatively, we may say that ef¬ 
fectiveness is equivalent to the map G >-)■ Bij(M), 
g i-G ^g, being injective; i.e., <I>g = Wm implies 
g = e. The action $ is called free iff $(g,TO) = to 
for some m G M implies g = e; i.e., “no g e fixes 
a point”. This is equivalent to the injectivity of all 
maps '■ G -G M, g $(g,TO), which can be 
expressed by saying that all orbits of G in M are 
faithful images of G. 


Here we are interested in smooth actions. For 
this we need to assume that G is a Lie group, that 
M is a differentiable manifold, and that the map 
(357) is smooth. We denote by exp : TgG —>■ G 
the exponential map. For each X G T^G there is a 


Recall that ^rn*e denotes the differential of the map 
<l>m evaluated at e G G. is also called the 

fundamental vector field on M associated to the 
action $ of G and to A G Tf,G. (We will later write 
Lie(G) for TgG, after we have discussed which Lie 
structure on TeG we choose.) 


In passing we note that from (364) it already 
follows that the flow map of is given by 


Fl}" (to) = <i>(exp(tA), to) . 


(365) 


This follows from exp(sA) exp(tA) = exp((s -|- 
t)A) and (359) (any of them), which imply 


Fir^ oFir^ = FI 




iW 

s+t 


(366) 


on the domain of M where all three maps appear¬ 
ing in (366) are defined. Uniqueness of flow maps 
for vector fields then suffices to show that (365) is 
indeed the flow of . 

Before we continue with the general case, we 
have a closer look at the special cases where M = G 
and 4) is either the left translation of G on G, 
4>(g, ft,) = Lg{h) := gft, or the right translation, 
4>(g, ft) = Rg[h) ■= hg. The corresponding funda¬ 
mental vector fields (364) are denoted by and 
respectively: 


V^{h) 


fL 

dt 

dt 


fexp(tA) ft) , 
t=o V / 



(367a) 

(367b) 


The seemingly paradoxical labeling of R for left 
and L for right translation finds its explanation in 
the fact that is right and is left invariant, 
i.e., Rg*V^ = and Lg^V^ = . Recall that 

the latter two equations are shorthands for 


Rg*hV^{h) = V^{hg), (368a) 

Lg*hV^{h) = V^igh). (368b) 
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The proofs of (368a) only uses (367a I and the chain 
rule: 




dt 


_d 

dt 

d 

dt 


t=o 


t=o 


^exp(fX) h 
^exp(tX) 
^exp(tX) hg 


R, 


= V^{hg). 


(369a) 


Similarly, the proof of (368b) starts from (367b): 


Rg^h^L i]R) — Rg^h 


d 

dt 

d 

dt 


t=o 


t=o 


(ji exp(tX)^ 

Rg(h exp(tX)^ 

(^gh exp{tX)^ 


= V^igh). 

In particular, we have 

V^{g)=Rg,eVi^{e) = Rg,,X, 
V^{g)=Rg.,V^{e) = Rg^.X, 


(369b) 


(370a) 

(370b) 


showing that the vector spaces of right/left invari¬ 
ant vector fields on G are isomorphic to TeG. More¬ 
over, the vector spaces of right/left invariant vector 
fields on G are Lie algebras, the Lie product being 
their ordinary commutator (as vector fields). This 
is true because the operation of commuting vector 
fields commutes with push-forward maps of diffeo- 
morphisms: IT] = (/)*IT]. This implies 

that the commutator of right/left invariant vector 
helds is again right/left invariant. Hence the iso¬ 
morphisms can be used to turn TgG into a Lie al¬ 
gebra, identifying it either with the Lie algebra of 
right- or left-invariant vector fields. The standard 
convention is to choose the latter. Hence, for any 
X,Y G Lie(G), one defines 


[X,y] :=[T/,T/](e). (371) 


TeG endowed with that structure is called Lie(G). 
Clearly, this turns Vl : Lie(G) —)■ TTG, X , 

into a Lie homomorphism: 

= (372) 


As a consequence, Vr : Lie(G) —TTG, X >->• , 

now turns out to be an anti Lie homomorphism, 
i.e., to contain an extra minus sign: 

(373) 

This can be proven directly but will also follow 
from the more general considerations below. 

On G consider the map 

G : G X G^ G 

1 (374) 

(h, 5 ) ^ hgh ^. 

For fixed h this map, Gh '■ G ^ G, g ^ Gh{g) = 
hgh~^, is an automorphism (i.e., self-isomorphism) 
of G. Automorphisms of G form a group (multi¬ 
plication being composition of maps) which we de¬ 
note by Aut(G). It is immediate that the map 
G —)■ Aut(G), h !-)• Gh, is a homomorphism of 
groups; i.e.. 


Ce = IdG , (375a) 

ChoGk = Ghk ■ (375b) 

Taking the differential at e S G of G/j we obtain 
a linear self-map of TgG, which we call Adh- 


Ad/i :— Gh*e ■ TeG —>■ TeG. 


(376a) 


Differentiating both sides of both equations (375) 
at e G G, using the chain rule together with 
Gfe(e) = e for the second, we infer that 


Ade = Wt.g , (376b) 

Ad?i o Adfc = Adhk ■ (376c) 

This implies, firstly, that each linear map ( |376a ) 
is invertible, i.e. an element of the general linear 
group GL(TeG) of the vector space TeG, and, sec¬ 
ondly, that the map 


Ad : G ^ GL(TeG) 
h I—)■ Adh 


(377) 


is a group homomorphism. In other words. Ad is 
a linear representation of G on TeG, called the ad¬ 
joint representation. 
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In (368) we saw that and are invari¬ 
ant under the action of right and left translations 
respectively (hence their names). But what hap¬ 
pens if we act on with left and on with 
right translations? The answer is obtained from 
straightforward computation. In the first case we 
get: 


Lg*h{y]i (h)) — Lgi,h 

_ d 
dt 

- ^ 
dt 


t=o 


t=o 


^exp(tX) h 
(^g exp{tX) 
(Cg{exp{tX)) gh^ 


= V, 


Adg(X) 


R 


(gh): 


(378a) 


where we used (376) in the last and the definition 
of in the first and last step. Similarly, in the 
second case we have 

Rg*h{VL{h)) = Rg*h^ exp{tX) hj 

(378b) 




Taking the differential of Ad at e G G we obtain 
a linear map from T^G into End(reG), the linear 
space of endomorphisms of TeG (linear self-maps 
ofTeG). 


ad := Ad*e : TkG ^ End(TeG) 
X i-G- adx • 


Now, we have 


adx(E) = [A,r] 


(379) 


(380) 


where the right-hand side is defined in (371). The 


proof of (380) starts from the fact that the commu¬ 


tator of two vector fields can be expressed in terms 
of the Lie derivative of the second with respect to 
the first vector held in the commutator, and the 


dehnition of the Lie derivative. We recall from 


(365) that the how of the left invariant vector helds 


is given by right translation: Flj ^ (g) = g exp(fA). 
Then we have 


[A,T] = [Vf,Ef](e) 
= (L^xE/)(e) 
_ _d 
dt 

- !L 

dt 
_ d 
dt 

- ^ 
dt 


,^/lg|.(v'2’(rlE(e)) 


t=0 y^>ds 
d 

t^o ds 


s=0 


Fl^"' (Flp^(e 


exp(fA) exp(sy) exp(—tA) 


t=o 


s=0 

Adexp(tJf)(b^) 


= adx(E). 


(381a) 


A completely analogous consideration, now using 


Fl^^ {g) = exp(tA) g, allows to compute the com¬ 
mutator of the right-invariant vector helds evalu¬ 
ated at e G G: 


t=o 


[V^.V^]{e) = iLy.V^)ie) 
d 
dt 
d 
dt 

- 

dt t=ocls s=o 

A-dexp(-tjs:)(F) 


Flg),(v^(Fir«(e))) 

^ Fir«(Fir«(e 


t=o y*>ds 


s=0 


_d 

t=o ds 


exp{—tX) exp(sy) exp(tA) 


dt 


t=o 


= -adx{Y) 
= -[X,Y]. 


(381b) 


Equation (373) now follows if we act on both 
sides of [V-^ ,V^](e) = —[A,F] with Rg^e and use 
(3^. 


We now return to the general case where M is 
any manifold and the vector held is dehned by 


an action $ as in (364) and whose how map is given 


by (365). Now, given that $ is a right action, we 
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obtain 


a left action we then get 


= {LyxV'^){m) 


d 

dt 

_d 

dt 

d 

dt 

_d 

dt 


FI 


t=o 




d 

-T U_+\^ -r- 


Fir (Fir K 


i=o ^ ^ as s=o 

<i)(exp(tX) exp(sF) exp(—tX), m) 


ds s —0 

= F[A^l(m) 


{y)) 


(382a) 


where we used (365) and (359b I at the fourth and 


(3801 at the last equality. Similarly, if $ is a left 


action, we have 

[V^,V^]{m) 

= {LyxV^){m) 


_d 

dt 

d 

dt 

_d 

dt 

d 


FI 


t=o 


r(F^(Fir(H) 


Fl 




Fir" (Fir (m)) 


4=0 ^ ds s=o 

<i)(exp(—tX) exp(sy) exp(tX), m) 


dt 


t —0 ds s —0 

(Adexp( —tx) (F)) 


(382b) 


$ 


3*m(F^(w)) = ^ (exp(tX) , m) 


t=0 


dt 

d 

dt 


<i )((7 exp(tX), to) 
<^{Cg{exp{tX)),g ■ to ) 


= <i) 


{g-'m)*e 


dt 


t^o 


Cg{exp{tX)) 


= ‘J’(3.m)*e(Adg(X)) 


(383a) 


Similarly, if <& is a right action, 
d 


$ 


g*7n{y^{m)) = $ 


g*m 


dt 


t=o 


$(exp(fX) , to) 


dt 

d 

dt 


$(exp(tX) m) 
$(Cg-i(exp(tX)),TO • g) 


= $ 


{m-g)*e 


dt 




Cg-i{exp{tX)) 


— ^(m g)*e (Adg-l (^)) 

= ■ g) 

= V^^o-y^\<^>(g,m)) . 


(383b) 


where we used (365) and (359a I at the fourth and 
again (380) at the last equality. 


Finally we derive the analog of (378) in the gen¬ 
eral case. This corresponds to computing the push- 
forward of under d)^. If $ is a left action we 
will obtain the analog of (378a), and the analog of 
(378b) if <!> is a right action. For easier readability 
we shall also make use of the notation (360). For 
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constrained Hamiltonian system, 1, 2, 28 
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Hamiltonian, 23 
in GR, 23 
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primary, 29 
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solve (methods, meaning), 40, 43, 53, 56 
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contravariant tensor, 3 
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cosmological constant, 4 
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tensor, 3 

CP-problem, in Quantum Gravity, 43 
curvature 

caused by matter, 6 
extrinsic, 14 
Gaussian, 4, 15, 17 
of spacetime, 6 
Ricci, 17 
Riemann, 16 
scalar, 17 
sectional, 16 
Weyl, 18 

d’Alembertian, see wave operator, 21 
DeWitt metric, 20, 26 
dynamical spacetime, 1 

Einstein 

equations, 1, 4 
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tensor, 4 
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density, 5 
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energy condition 

energy dominance, 6 
strong, 6 
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energy-momentum tensor, 4, 5 
ephemeris time 

gravitational, 47 
Euclidean metric, 3 
Euclidean Quantum Gravity, 3 
Euler Lagrange equations, 27 
evolution of space, 1 
extrinsic curvature, 14 

first class constraints, 29 
first fundamental form, 14 
foliation 

leaves of, 35 
mean curvature, 51 
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of spacetime (by timelike curves), 12 
Frobenius theorem, 33 
function 

energy, 28 
Hamiltonian, 29 
fundamental 

form (first and second), 14 
group (of configuration space), 43 

gauge 

algebra, 34 
group, 36 

redundancies versus symmetries, 29 
transformations, 29 

transformations (asympt. flat case), 49 
York, 25 

Gauss equation, 21 
Gaussian curvature, 4, 15, 17 
geometric object, 40 
geon, 55 


globally hyperbolic, 9 
gravitational constant, 6 
group 

gauge, 36 

of gauge transformations, 36 
group action, 7, 61 
by isometries, 7 
left and right, 7, 61 
group representation 
adjoint, 8, 9, 63 
co-adjoint, 9 
groupoids, 36 

Hamiltonian, 29 
Hamiltonian vector field, 32 
harmonic slicing, 26 
history of space, 1 

ideal, 40 

associative, 34 
Lie, 34 
Poisson, 34 
idealizer. Lie, 34 
initial data, Bowen York, 57 
isotropic submanifold, 33 

Jacobi 

identity, 32, 34 

metric (first and second), 46 

principle, 46 

Killing fields, 7 
kinetic-energy metric, 46 
Komar mass, 52 
Kruskal spacetime, 55 
Kulkarni-Nomizu product, 18 

Lagrangian (Lagrange function), 27 
Lagrangian submanifold, 33 
Laplacian, conformally covariant, 21 
lapse function, 12 
Lie 

algebra, 7, 32-36, 39, 60, 63 
algebra, dual of, 8 
algebroids, 36 

anti-homomorphism, 7, 35, 41 
anti-isomorphism, 63 
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group, 7, 35, 62 
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idealizer, 34 

lift, of diffeomorphims to cotangent bundle, 30 

Loop Quantum Gravity, 60 

Lorentz 

group, 9 
metric, 3, 10 

signature of De Witt metric, 26 
transformations, 9 

mass 

ADM, 50 
Komar, 52 
Tolman, 53 

Maupertuis’ principle, 46 
maximal slicing, 25 
mean curvature, constant, 57 
metric 

connection compatible with, 16 
DeWitt, 20, 26 
Euclidean, 3 
Lorentzian, 3 
on manifold, 3 
Riemannian, 3 
mixed tensor, 3 
momentum 

current-density, 5 
density, 5 
map, 9, 35 

musical isomorphisms, 3 

Newton’s constant, 6 

observables, physical, 29 

parallelizable, 10 
parity conditions, 48 
path integral, 3 
paths independence, 42 
physical 

observables, 29, 34 
phase space, 33 
states, 34 


Poincare 

charges, 2, 60 
group, 48 
Poisson 

algebra (general), 32, 34 
algebra (of physical observables), 34 
bracket, 32 
ideal, 34 

primary constraints, 29 
principal 

curvature directions, 15 
curvatures, 15 
radii, 15 
principle 

Jacobi’s, 46 
Maupertuis’, 46 
of least action (in GR), 36 
of least action (in mechanics), 46 
pull back, 3 
puncture method, 57 
push forward, 3 

quantization, canonical, 2 
Quantum Gravity, 2, 60 

reduced space 

of observables, 34 
of states, 33 
reduction 

of phase space, 40 
program (geometric), 33 
symplectic, 33 
redundancy 
gauge, 2 

gauge (versus symmetries), 29 
in representing spacetime, 1, 2 
Ricci curvature, 17 
Riemann 

curvature, 16 
tensor, 16 

Riemannian metric, 3 

scalar curvature, 17 
Schwarzschild spacetime, 55 
second fundamental form, 14 
secondary constraints, 29 
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sectional curvature, 4, 16, 17 
shape operator, 14 
shift vector-field, 12 
slicing 

harmonic, 26 
maximal, 25 

spacetime (dynamical), 1 
speed of light, 6 
spin structure, 10 
structure 

constants, 36 
functions, 36 
submanifold 

co-isotropic, 33 
isotropic, 33 
Lagrangian, 33 
superspace, Wheeler’s, 43 
supertranslations, odd parity, 49 
surface, constraint, 28 
symmetric hyperbolicity, 24 
symmetries 

gauge (versus redundancies), 29 
symmetry group, asymptotic, 54 
symmetry, of energy-momentum tensor, 5 
symplectic 

morphism, 30 
potential, 30 
reduction, 33 
structure, 30 
systems, constrained, 28 

tensor, 3 

contravariant, 3 
covariant, 3 
Einstein, 4 
energy-momentum, 4 
fields, 3 
mixed, 3 
Riemann, 16 
theorem 

of Frobenius, 33 

of Gauss (theorema egregium), 15 
of Gromov-Lawson, 25, 45 
of Kazdan-Warner, 25, 45 
theta sectors, in Quantum Gravity, 43 
thin-sandwich 


conformal variant, 45 
conjecture, 44 
equations, 44 
problem, 44 

time symmetric data, 54 
Tolman mass, 53 
torsion, 14 
torsion free, 16 
transformation, canonical, 30 

vector constraints, no ideal, 40 
vector field, Hamiltonian, 32 
velocity of light, 6 

wave operator, conformally covariant, 21 
Weingarten map, 14, 15 
well posedness 

of initial-value problem for constraints, 24 
Weyl curvature, 18 

York 

equation, 57 
gauge, 25 
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